RESUMEN
De novo mutations are known to play a prominent role in sporadic disorders with reduced fitness. We hypothesize that de novo mutations play an important role in severe male infertility and explain a portion of the genetic causes of this understudied disorder. To test this hypothesis, we utilize trio-based exome sequencing in a cohort of 185 infertile males and their unaffected parents. Following a systematic analysis, 29 of 145 rare (MAF < 0.1%) protein-altering de novo mutations are classified as possibly causative of the male infertility phenotype. We observed a significant enrichment of loss-of-function de novo mutations in loss-of-function-intolerant genes (p-value = 1.00 × 10-5) in infertile men compared to controls. Additionally, we detected a significant increase in predicted pathogenic de novo missense mutations affecting missense-intolerant genes (p-value = 5.01 × 10-4) in contrast to predicted benign de novo mutations. One gene we identify, RBM5, is an essential regulator of male germ cell pre-mRNA splicing and has been previously implicated in male infertility in mice. In a follow-up study, 6 rare pathogenic missense mutations affecting this gene are observed in a cohort of 2,506 infertile patients, whilst we find no such mutations in a cohort of 5,784 fertile men (p-value = 0.03). Our results provide evidence for the role of de novo mutations in severe male infertility and point to new candidate genes affecting fertility.
Asunto(s)
Azoospermia/genética , Proteínas de Ciclo Celular/genética , Proteínas de Unión al ADN/genética , Predisposición Genética a la Enfermedad , Mutación con Pérdida de Función , Mutación Missense , Oligospermia/genética , Proteínas de Unión al ARN/genética , Proteínas Supresoras de Tumor/genética , Adulto , Azoospermia/patología , Estudios de Casos y Controles , Proteínas de Ciclo Celular/deficiencia , Proteínas de Unión al ADN/deficiencia , Exoma , Expresión Génica , Perfilación de la Expresión Génica , Humanos , Masculino , Oligospermia/patología , Proteínas Supresoras de Tumor/deficiencia , Secuenciación del ExomaRESUMEN
Intrachromosomal amplification of chromosome 21 (iAMP21) identifies a high-risk subtype of acute lymphoblastic leukaemia (ALL), requiring intensive treatment to reduce their relapse risk. Improved understanding of the genomic landscape of iAMP21-ALL will ascertain whether these patients may benefit from targeted therapy. We performed whole-exome sequencing of eight iAMP21-ALL samples. The mutation rate was dramatically disparate between cases (average 24.9, range 5-51) and a large number of novel variants were identified, including frequent mutation of the RAS/MEK/ERK pathway. Targeted sequencing of a larger cohort revealed that 60% (25/42) of diagnostic iAMP21-ALL samples harboured 42 distinct RAS pathway mutations. High sequencing coverage demonstrated heterogeneity in the form of multiple RAS pathway mutations within the same sample and diverse variant allele frequencies (VAFs) (2-52%), similar to other subtypes of ALL. Constitutive RAS pathway activation was observed in iAMP21 samples that harboured mutations in the predominant clone (⩾35% VAF). Viable iAMP21 cells from primary xenografts showed reduced viability in response to the MEK1/2 inhibitor, selumetinib, in vitro. As clonal (⩾35% VAF) mutations were detected in 26% (11/42) of iAMP21-ALL, this evidence of response to RAS pathway inhibitors may offer the possibility to introduce targeted therapy to improve therapeutic efficacy in these high-risk patients.
Asunto(s)
Aberraciones Cromosómicas , Cromosomas Humanos Par 21 , Sistema de Señalización de MAP Quinasas/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Proteínas ras/metabolismo , Animales , Bencimidazoles/farmacología , Supervivencia Celular , Xenoinjertos , Humanos , Sistema de Señalización de MAP Quinasas/efectos de los fármacos , Ratones , Tasa de Mutación , Análisis de Secuencia de ADNRESUMEN
Autosomal recessive polycystic kidney disease (ARPKD) is an inherited ciliopathy leading to progressive kidney and liver disease. Biallelic mutations in the PKHD1 gene underlie this condition. We describe a child with bilaterally enlarged cystic kidneys, portal hypertension, and cerebral ventriculomegaly. Molecular genetic investigations using whole-exome sequencing and confirmation using Sanger sequencing revealed a homozygous pathogenic mutation in PKHD1 underlying the clinical phenotype of ARPKD. Whole-exome data analysis was used to search for additional rare variants in additional ciliopathy genes that may have contributed to the unusual brain phenotype. Aside from a rare hypomorphic allele in MKS1, no other pathogenic variants were detected. We conclude that the homozygous pathogenic mutation in PKHD1 underlies the ciliopathy phenotype in this patient.
Asunto(s)
Exoma/genética , Hidrocefalia/genética , Mutación Missense , Riñón Poliquístico Autosómico Recesivo/genética , Receptores de Superficie Celular/genética , Secuencia de Bases , Preescolar , Análisis Mutacional de ADN/métodos , Femenino , Homocigoto , Humanos , Hidrocefalia/patología , Riñón Poliquístico Autosómico Recesivo/patologíaRESUMEN
Sporadic late onset cerebellar ataxia is a well-described clinical presentation with a broad differential diagnosis that adult neurologists should be familiar with. However, despite extensive clinical investigations, an acquired cause is identified in only a minority of cases. Thereafter, an underlying genetic basis is often considered, even in those without a family history. Here we apply whole exome sequencing to a cohort of 12 patients with late onset cerebellar ataxia. We show that 33% of 'idiopathic' cases harbor compound heterozygous mutations in known ataxia genes, including genes not included on multi-gene panels, or primarily associated with an ataxic presentation.
Asunto(s)
Exoma/genética , Genes Recesivos/genética , Degeneraciones Espinocerebelosas/genética , Adulto , Anciano , Estudios de Cohortes , Inglaterra , Femenino , Humanos , Masculino , Persona de Mediana Edad , Mutación/genética , Tasa de Mutación , Análisis de Secuencia de ADNRESUMEN
OBJECTIVE: Resident macrophages play an important role in atheromatous plaque rupture. The macrophage gene expression signature associated with plaque rupture is incompletely defined due to the complex cellular heterogeneity in the plaque. We aimed to characterise differential gene expression in resident plaque macrophages from ruptured and stable human atheromatous lesions. METHODS AND RESULTS: We performed genome-wide expression analyses of isolated macrophage-rich regions of stable and ruptured human atherosclerotic plaques. Plaques present in carotid endarterectomy specimens were designated as stable or ruptured using clinical, radiological and histopathological criteria. Macrophage-rich regions were excised from 5 ruptured and 6 stable plaques by laser micro-dissection. Transcriptional profiling was performed using Affymetrix microarrays. The profiles were characteristic of activated macrophages. At a false discovery rate of 10%, 914 genes were differentially expressed between stable and ruptured plaques. The findings were confirmed in fourteen further stable and ruptured samples for a subset of eleven genes with the highest expression differences (p < 0.05). Pathway analysis revealed that components of the PPAR/Adipocytokine signaling pathway were the most significantly upregulated in ruptured compared to stable plaques (p = 5.4 × 10(-7)). Two key components of the pathway, fatty-acid binding-protein 4 (FABP4) and leptin, showed nine-fold (p = 0.0086) and five-fold (p = 0.0012) greater expression respectively in macrophages from ruptured plaques. CONCLUSIONS: We found differences in gene expression signatures between macrophages isolated from stable and ruptured human atheromatous plaques. Our findings indicate the involvement of FABP4 and leptin in the progression of atherosclerosis and plaque rupture, and suggest that down-regulation of PPAR/adipocytokine signaling within plaques may have therapeutic potential.
Asunto(s)
Proteínas de Unión a Ácidos Grasos/biosíntesis , Leptina/biosíntesis , Placa Aterosclerótica/metabolismo , Anciano , Proteínas de Unión a Ácidos Grasos/genética , Femenino , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Leptina/genética , Macrófagos/metabolismo , Masculino , Placa Aterosclerótica/complicaciones , Placa Aterosclerótica/genética , Rotura EspontáneaRESUMEN
OBJECTIVE: To define for the first time the transcriptomes of normal and end-stage osteoarthritis (OA) hip cartilage. MATERIALS AND METHODS: RNA was isolated from cartilage within 2h of joint replacement surgery. Gene expression was analyzed using Agilent GeneSpring GX 11 following hybridization to Illumina Human HT-12 V3 microarrays. Real-time reverse-transcription polymerase chain reaction (RT-PCR) was used to validate the expression of six genes identified by microarray as differentially expressed. Gene Set Enrichment Analysis (GSEA) and Ingenuity Pathway Analysis (IPA) were used to investigate enriched functions or canonical pathways amongst differentially expressed genes respectively. RESULTS: In total we identified 998 differentially expressed genes (fold change ≥ ±1.5, P-value ≤ 0.01) between neck of femur fracture (NOF) (n = 10) and OA hip (n = 9) patient cartilage. These differentially expressed genes were enriched within 71 canonical pathways. A comparison between a comparable knee dataset(20) only identified 229 genes similarly differentially expressed although remarkably 34 canonical pathways overlapped between experiments. CONCLUSIONS: This study is the first to report a comprehensive gene expression analysis of human hip OA cartilage compared to control (NOF) cartilage at the whole-genome level. Our differential gene expression dataset shows excellent correlation with similar defined studies using comparable tissue but reveals discord between hip and knee OA at the individual gene status but with commonality with regards the molecular pathways involved.
Asunto(s)
Cartílago Articular/metabolismo , Osteoartritis de la Cadera/genética , Osteoartritis de la Rodilla/genética , Transcriptoma/genética , Anciano , Femenino , Humanos , ARN/genética , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Vía de Señalización WntRESUMEN
Mononucleotide repeats (MNRs) are abundant in eukaryotic genomes and exhibit a high degree of length variability due to insertion and deletion events. However, the relationship between these repeats and mutation rates in surrounding sequences has not been systematically investigated. We have analyzed the frequency of single nucleotide polymorphisms (SNPs) at positions close to and within MNRs in the human genome. Overall, we find a 2- to 4-fold increase in the SNP frequency at positions immediately adjacent to the boundaries of MNRs, relative to that at more distant bases. This relationship exhibits a strong asymmetry between 3' and 5' ends of repeat tracts and is dependent upon the repeat motif, length and orientation of surrounding repeats. Our analysis suggests that the incorporation or exclusion of bases adjacent to the boundary of the repeat through substitutions, in which these nucleotides mutate towards or away from the base present within the repeat, respectively, may be another mechanism by which MNRs expand and contract in the human genome.
Asunto(s)
Genoma Humano , Repeticiones de Microsatélite , Polimorfismo de Nucleótido Simple , Humanos , Nucleótidos/químicaRESUMEN
Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.
Asunto(s)
Neuroblastoma/genética , Neuroblastoma/patología , Animales , Línea Celular Tumoral , Aberraciones Cromosómicas , Cromosomas Humanos Par 1 , Cromosomas Humanos Par 17 , Progresión de la Enfermedad , Amplificación de Genes , Dosificación de Gen , Regulación Neoplásica de la Expresión Génica , Humanos , Hibridación Fluorescente in Situ , Ratones , Proteína Proto-Oncogénica N-Myc , Proteínas Nucleares/biosíntesis , Proteínas Nucleares/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Proteínas Oncogénicas/biosíntesis , Proteínas Oncogénicas/genética , Polimorfismo de Nucleótido Simple , Ratas , Tasa de SupervivenciaRESUMEN
A significant proportion of the variation between individuals in gene expression levels is genetic, and it is likely that these differences correlate with phenotypic differences or with risk of disease. Cis-acting polymorphisms are important in determining interindividual differences in gene expression that lead to allelic expression imbalance, which is the unequal expression of homologous alleles in individuals heterozygous for such a polymorphism. This expression imbalance can be detected using a transcribed polymorphism, and, once it is established, the next step is to identify the polymorphisms that are responsible for or predictive of allelic expression levels. We present an expectation-maximization algorithm for such analyses, providing a formal statistical framework to test whether a candidate polymorphism is associated with allelic expression differences.
Asunto(s)
Algoritmos , Desequilibrio Alélico/genética , Expresión Génica , Polimorfismo Genético , HumanosRESUMEN
Aurora kinases are key regulators of chromosome segregation during mitosis. We have previously shown by microarray analysis of primary lung carcinomas and matched normal tissue that AURKB (22 out of 37) and AURKA (15 out of 37) transcripts are frequently over-represented in these tumours. We now confirm these observations in a second series of 44 carcinomas and also show that aurora B kinase protein levels are raised in the tumours compared to normal tissue. Elevated levels of expression in tumours are not a consequence of high-level amplification of the AURKB gene. Using a coding sequence polymorphism we show that in most cases (seven out of nine) tumour expression is predominantly driven from one AURKB allele. Given the function of aurora B kinase, we examined whether there was an association between expression levels and genetic instability. We defined two groups of high and low AURKB expression. Using a panel of 10 microsatellite markers, we found that the group showing the higher level of expression had a higher frequency of allelic imbalance (P=0.0012). Analysis of a number of other genes that are strongly and specifically expressed in tumour over normal lung, including SERPINB5, TERT and PRAME, showed marked allelic expression imbalances in the tumour tissue in the context of balanced or only marginally imbalanced relative allelic copy numbers. Our data support a model of early carcinogenesis wherein defects in the process of inactivation of lung stem-cell associated genes during differentiation, contributes to the development of carcinogenesis.
Asunto(s)
Desequilibrio Alélico , Carcinoma de Pulmón de Células no Pequeñas/enzimología , Regulación Enzimológica de la Expresión Génica , Neoplasias Pulmonares/enzimología , Proteínas Serina-Treonina Quinasas/genética , Antígenos de Neoplasias/genética , Antígenos de Neoplasias/metabolismo , Aurora Quinasa A , Aurora Quinasa B , Aurora Quinasas , Biomarcadores de Tumor/metabolismo , Western Blotting , Carcinoma de Pulmón de Células no Pequeñas/patología , Dominio Catalítico , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Células Epiteliales/metabolismo , Células Epiteliales/patología , Genes Supresores de Tumor , Humanos , Neoplasias Pulmonares/patología , Repeticiones de Microsatélite , Polimorfismo de Longitud del Fragmento de Restricción , Proteínas Serina-Treonina Quinasas/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Inhibidores de Serina Proteinasa/genética , Inhibidores de Serina Proteinasa/metabolismo , Serpinas/genética , Serpinas/metabolismo , Telomerasa/genética , Telomerasa/metabolismo , Regulación hacia ArribaRESUMEN
O6-Alkylguanine-DNA alkyltransferase (MGMT) confers resistance to many of the mutagenic and toxic effects of certain classes of alkylating agents by repairing the DNA lesions responsible. The levels of expression of this protein are of interest in relation to the prevention and treatment of cancer in man. They vary widely between individuals, and the basis of this variation is not understood. RT-PCR-RFLP analysis of mRNA from normal human lung tissue reveals that the two MGMT alleles are frequently expressed at different levels, indicating that there is a genetic component to inter-individual variation of MGMT levels and that at least some of this variation maps close to or within the MGMT locus.
Asunto(s)
Pulmón/fisiología , O(6)-Metilguanina-ADN Metiltransferasa/biosíntesis , O(6)-Metilguanina-ADN Metiltransferasa/genética , ARN Mensajero/análisis , Alelos , Reparación del ADN/genética , Humanos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Transcripción GenéticaRESUMEN
The identification of deleterious mutations within candidate genes is a crucial step in the elucidation of the genetic bases of human disease. However, the significance of any base or amino acid change within a gene is unknown until detailed structural and functional analysis has been carried out. A potentially rapid way of identifying functionally important sites within a gene is to identify evolutionarily conserved regions. Mutations affecting such sites are assumed to be deleterious for the carrier. In this communication we generalize this approach and present a formal framework to assess whether a specific mutation is deleterious given sequence data from a set of homologues. We propose a score that takes into account the nature of the mutation, the conservation of the affected residue among the different species, and their phylogenetic relationships. Its performance is examined using published TP53 mutations and frequent polymorphic variants.
Asunto(s)
Genes/genética , Enfermedades Genéticas Congénitas/genética , Mutación Missense , Filogenia , Biología Computacional/métodos , Biología Computacional/estadística & datos numéricos , Análisis Mutacional de ADN/métodos , Análisis Mutacional de ADN/estadística & datos numéricos , Bases de Datos Genéticas/estadística & datos numéricos , Evolución Molecular , Frecuencia de los Genes/genética , Genes p53/genética , Variación Genética , Humanos , Modelos Genéticos , Proteína p53 Supresora de Tumor/química , Proteína p53 Supresora de Tumor/clasificación , Proteína p53 Supresora de Tumor/genéticaAsunto(s)
Genoma , Repeticiones de Microsatélite/genética , Animales , Secuencia de Bases , Cebus , Genoma Humano , Gorilla gorilla , Humanos , Hylobates , Macaca fascicularis , Ratones , Datos de Secuencia Molecular , Pan paniscus , Pongo pygmaeus , RatasRESUMEN
Polyglutamine repeats within proteins are common in eukaryotes and are associated with neurological diseases in humans. Many are encoded by tandem repeats of the codon CAG that are likely to mutate primarily by replication slippage. However, a recent study in the yeast Saccharomyces cerevisiae has indicated that many others are encoded by mixtures of CAG and CAA which are less likely to undergo slippage. Here we attempt to estimate the proportions of polyglutamine repeats encoded by slippage-prone structures in species currently the subject of genome sequencing projects. We find a general excess over random expectation of polyglutamine repeats encoded by tandem repeats of codons. We nevertheless find many repeats encoded by nontandem codon structures. Mammals and Drosophila display extreme opposite patterns. Drosophila contains many proteins with polyglutamine tracts but these are generally encoded by interrupted structures. These structures may have been selected to be resistant to slippage. In contrast, mammals (humans and mice) have a high proportion of proteins in which repeats are encoded by tandem codon structures. In humans, these include most of the triplet expansion disease genes.
Asunto(s)
Evolución Molecular , Péptidos/genética , Secuencias Repetidas en Tándem/genética , Animales , Codón/genética , Bases de Datos Factuales , Drosophila melanogaster/genética , Humanos , RatonesRESUMEN
The evolutionary expansion of CAG repeats in human triplet expansion disease genes is intriguing because of their deleterious phenotype. In the past, this expansion has been suggested to reflect a broad genomewide expansion of repeats, which would imply that mutational and evolutionary processes acting on repeats differ between species. Here, we tested this hypothesis by analyzing repeat- and flanking-sequence evolution in 28 repeat-containing genes that had been sequenced in humans and mice and by considering overall lengths and distributions of CAG repeats in the two species. We found no evidence that these repeats were longer in humans than in mice. We also found no evidence for preferential accumulation of CAG repeats in the human genome relative to mice from an analysis of the lengths of repeats identified in sequence databases. We then investigated whether sequence properties, such as base and amino acid composition and base substitution rates, showed any relationship to repeat evolution. We found that repeat-containing genes were enriched in certain amino acids, presumably as the result of selection, but that this did not reflect underlying biases in base composition. We also found that regions near repeats showed higher nonsynonymous substitution rates than the remainder of the gene and lower nonsynonymous rates in genes that contained a repeat in both the human and the mouse. Higher rates of nonsynonymous mutation in the neighborhood of repeats presumably reflect weaker purifying selection acting in these regions of the proteins, while the very low rate of nonsynonymous mutation in proteins containing a CAG repeat in both species presumably reflects a high level of purifying selection. Based on these observations, we propose that the mutational processes giving rise to polyglutamine repeats in human and murine proteins do not differ. Instead, we propose that the evolution of polyglutamine repeats in proteins results from an interplay between mutational processes and selection.
Asunto(s)
Evolución Molecular , Selección Genética , Repeticiones de Trinucleótidos/genética , Aminoácidos , Animales , Composición de Base , Codón , Bases de Datos Factuales , Enfermedad/etiología , Variación Genética , Humanos , Ratones , Expansión de Repetición de Trinucleótido/genéticaRESUMEN
Long amino acid repeats are often observed in eukaryotic proteins. In humans, several neurological disorders are caused by proteins containing abnormally long polyglutamines. However, no systematic analysis has attempted to investigate the relationship between reiterations of particular amino acids and protein function, the possible mechanisms involved in the generation of these regions, or the contribution of selection in restricting their genomic distribution, in a large collection of wild-type proteins. We have used baker's yeast open reading frames to study these questions. The most abundant amino acid repeats found in yeast proteins are repeats of glutamine, asparagine, aspartic acid, glutamic acid, and serine. Different amino acid repeats are concentrated in different classes of proteins. Acidic and polar amino acid repeats are significantly associated with transcription factors and protein kinases, while serine repeats are significantly associated with membrane transporter proteins. In most cases the codon structures encoding the repeats at the gene level show a significant bias toward long tracts of one of the possible codons, suggesting that trinucleotide slippage has played an important role in generating these reiterations. However, many, particularly those encoding serine repeats, do not show evidence of slippage. The distributions of codon repeats within proteins and between coding and noncoding regions of the genome, and of amino acids between proteins with different functions, suggest that repeats of these kinds are subject to strong selection.
Asunto(s)
Proteínas Fúngicas/química , Proteínas Fúngicas/clasificación , Modelos Genéticos , Mutagénesis/genética , Secuencias Repetitivas de Aminoácido/genética , Saccharomyces cerevisiae/genética , Secuencia de Bases , Codón/genética , Bases de Datos Factuales , Evolución Molecular , Proteínas Fúngicas/genética , Frecuencia de los Genes , Genoma Fúngico , Sistemas de Lectura Abierta/genética , Péptidos/química , Péptidos/genética , Proteínas Quinasas/química , Proteínas Quinasas/genética , Saccharomyces cerevisiae/química , Selección Genética , Electricidad Estática , Relación Estructura-Actividad , Secuencias Repetidas en Tándem/genética , Factores de Transcripción/química , Factores de Transcripción/genéticaRESUMEN
Recently, data on loss of constitutional heterozygosity (LOH) have been used by several groups to increase the power to detect linkage in pedigrees with an inherited cancer predisposition. This approach assumes that the predisposition is due to the inheritance of the defective copy of a tumor suppressor. In order to assess the gain of power expected from the inclusion of LOH data, we simulated segregation and somatic loss of alleles in pedigrees consisting of an affected pair of first-degree relatives. We explored the effects of pedigree structure, frequency of loss, penetrance, and recombination rate on the expected LOD score. The results indicate that, for establishment of genetic linkage, isolated parent-offspring pairs can be as informative as sib pairs and that they could represent an additional source of information in linkage studies.