Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 69
Filtrar
1.
Genes (Basel) ; 14(9)2023 09 13.
Artículo en Inglés | MEDLINE | ID: mdl-37761935

RESUMEN

We propose a computational framework for selecting biologically plausible genes identified by clustering of multi-omics data that reveal patients' similarity, thus giving researchers a more comprehensive view on any given disease. We employ spectral clustering of a similarity network created by fusion of three similarity networks, based on mRNA expression of immune genes, miRNA expression and DNA methylation data, using SNF_v2.1 software. For each cluster, we rank multi-omics features, ensuring the best separation between clusters, and select the top-ranked features that preserve clustering. To find genes targeted by DNA methylation and miRNAs found in the top-ranked features, we use chromosome-conformation capture data and miRNet2.0 software, respectively. To identify informative genes, these combined sets of target genes are analyzed in terms of their enrichment in somatic/germline mutations, GO biological processes/pathways terms and known sets of genes considered to be important in relation to a given disease, as recorded in the Molecular Signature Database from GSEA. The protein-protein interaction (PPI) networks were analyzed to identify genes that are hubs of PPI networks. We used data recorded in The Cancer Genome Atlas for patients with acute myeloid leukemia to demonstrate our approach, and discuss our findings in the context of results in the literature.


Asunto(s)
Leucemia Mieloide Aguda , MicroARNs , Humanos , Multiómica , Biología Computacional/métodos , Leucemia Mieloide Aguda/genética , MicroARNs/genética , MicroARNs/metabolismo , Programas Informáticos
2.
Aging (Albany NY) ; 11(23): 11244-11267, 2019 12 03.
Artículo en Inglés | MEDLINE | ID: mdl-31794428

RESUMEN

Despite a growing number of studies on longevity in Drosophila, genetic factors influencing lifespan are still poorly understood. In this paper we propose a conceptually new approach for the identification of novel longevity-associated genes and potential target genes for SNPs in non-coding regions by utilizing the knowledge of co-location of various loci, governed by the three-dimensional architecture of the Drosophila genome. Firstly, we created networks between genes/genomic regions harboring SNPs deemed to be significant in two longevity GWAS summary statistics datasets using intra- and inter-chromosomal interaction frequencies (Hi-C data) as a measure of co-location. These networks were further extended to include regions strongly interacting with previously selected regions. Using various network measures, literature search and additional bioinformatics resources, we investigated the plausibility of genes found to have genuine association with longevity. Several of the newly identified genes were common between the two GWAS datasets and these possessed human orthologs. We also found that the proportion of non-coding SNPs in borders between topologically associated domains is significantly higher than expected by chance. Assuming co-location, we investigated potential target genes for non-coding SNPs. This approach therefore offers a stepping stone to identification of novel genes and SNP targets linked to human longevity.


Asunto(s)
Proteínas de Drosophila/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/fisiología , Estudio de Asociación del Genoma Completo , Longevidad/genética , Animales , Análisis por Conglomerados , Bases de Datos de Ácidos Nucleicos , Proteínas de Drosophila/genética , Redes Reguladoras de Genes , Genoma de los Insectos , Polimorfismo de Nucleótido Simple
3.
Sci Rep ; 9(1): 17940, 2019 11 29.
Artículo en Inglés | MEDLINE | ID: mdl-31784692

RESUMEN

Genome-wide association studies identified numerous loci harbouring single nucleotide polymorphisms (SNPs) associated with various human diseases, although the causal role of many of them remains unknown. In this paper, we postulate that co-location and shared biological function of novel genes with genes known to associate with a specific phenotype make them potential candidates linked to the same phenotype ("guilt-by-proxy"). We propose a novel network-based approach for predicting candidate genes/genomic regions utilising the knowledge of the 3D architecture of the human genome and GWAS data. As a case study we used a well-studied polygenic disorder ‒ schizophrenia ‒ for which we compiled a comprehensive dataset of SNPs. Our approach revealed 634 novel regions covering ~398 Mb of the human genome and harbouring ~9000 genes. Using various network measures and enrichment analysis, we identified subsets of genes and investigated the plausibility of these genes/regions having an association with schizophrenia using literature search and bioinformatics resources. We identified several genes/regions with previously reported associations with schizophrenia, thus providing proof-of-concept, as well as novel candidates with no prior known associations. This approach has the potential to identify novel genes/genomic regions linked to other polygenic disorders and provide means of aggregating genes/SNPs for further investigation.


Asunto(s)
Esquizofrenia/genética , Cromosomas , Femenino , Redes Reguladoras de Genes , Predisposición Genética a la Enfermedad , Genoma Humano , Estudio de Asociación del Genoma Completo , Genómica , Humanos , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo
4.
J Theor Biol ; 406: 99-104, 2016 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-27354314

RESUMEN

Metabolic reaction data is commonly modelled using a complex network approach, whereby nodes represent the chemical species present within the organism of interest, and connections are formed between those nodes participating in the same chemical reaction. Unfortunately, such an approach provides an inadequate description of the metabolic process in general, as a typical chemical reaction will involve more than two nodes, thus risking oversimplification of the system of interest in a potentially significant way. In this paper, we employ a complex hypernetwork formalism to investigate the robustness of bacterial metabolic hypernetworks by extending the concept of a percolation process to hypernetworks. Importantly, this provides a novel method for determining the robustness of these systems and thus for quantifying their resilience to random attacks/errors. Moreover, we performed a site percolation analysis on a large cohort of bacterial metabolic networks and found that hypernetworks that evolved in more variable environments displayed increased levels of robustness and topological complexity.


Asunto(s)
Redes y Vías Metabólicas , Modelos Biológicos , Buchnera/metabolismo , Escherichia coli/metabolismo , Estadística como Asunto
5.
Hum Mutat ; 37(1): 65-73, 2016 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-26466920

RESUMEN

Missense/nonsense mutations and microdeletions/microinsertions (<21 bp) represent ∼ 76% of all mutations causing human inherited disease, and their occurrence has been associated with sequence motifs (direct, inverted, and mirror repeats; G-quartets) capable of adopting non-B DNA structures. We found that a significant proportion (∼ 21%) of both microdeletions and microinsertions occur within direct repeats, and are explicable by slipped misalignment. A novel mutational mechanism, DNA triplex formation followed by DNA repair, may explain ∼ 5% of microdeletions and microinsertions at mirror repeats. Further, G-quartets, direct, and inverted repeats also appear to play a prominent role in mediating missense mutations, whereas only direct and inverted repeats mediate nonsense mutations. We suggest a mutational mechanism involving slipped strand mispairing, slipped structure formation, and DNA repair, to explain ∼ 15% of missense and ∼ 12% of nonsense mutations yielding perfect direct repeats from imperfect repeats, or the extension of existing direct repeats. Similar proportions of missense and nonsense mutations were explicable by hairpin/loop formation and DNA repair, yielding perfect inverted repeats from imperfect repeats. We also propose a model for single base-pair substitution based on one-electron oxidation reactions at G-quadruplex DNA. Overall, the proposed mechanisms provide support for a role for non-B DNA structures in human gene mutagenesis.


Asunto(s)
ADN Forma B/genética , Estudios de Asociación Genética , Enfermedades Genéticas Congénitas/genética , Mutagénesis Insercional , Eliminación de Secuencia , Secuencia de Bases , Biología Computacional , ADN Forma B/química , Bases de Datos Genéticas , Conjuntos de Datos como Asunto , Humanos , Conformación de Ácido Nucleico , Secuencias Repetitivas de Ácidos Nucleicos
6.
Hum Genomics ; 9: 25, 2015 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-26446085

RESUMEN

BACKGROUND: Facioscapulohumeral dystrophy (FSHD) is commonly associated with contraction of the D4Z4 macro-satellite repeat on chromosome 4q35 (FSHD1) or mutations in the SMCHD1 gene (FSHD2). Recent studies have shown that the clinical manifestation of FSHD1 can be modified by mutations in the SMCHD1 gene within a given family. The absence of either D4Z4 contraction or SMCHD1 mutations in a small cohort of patients suggests that the disease could also be due to disruption of gene regulation. In this study, we postulated that mutations responsible for exerting a modifier effect on FSHD might reside within remotely acting regulatory elements that have the potential to interact at a distance with their cognate gene promoter via chromatin looping. To explore this postulate, genome-wide Hi-C data were used to identify genomic fragments displaying the strongest interaction with the SMCHD1 gene. These fragments were then narrowed down to shorter regions using ENCODE and FANTOM data on transcription factor binding sites and epigenetic marks characteristic of promoters, enhancers and silencers. RESULTS: We identified two regions, located respectively ~14 and ~85 kb upstream of the SMCHD1 gene, which were then sequenced in 229 FSHD/FSHD-like patients (200 with D4Z4 repeat units <11). Three heterozygous sequence variants were found ~14 kb upstream of the SMCHD1 gene. One of these variants was found to be of potential functional significance based on DNA methylation analysis. Further functional ascertainment will be required in order to establish the clinical/functional significance of the variants found. CONCLUSIONS: In this study, we propose an improved approach to predict the possible locations of remotely acting regulatory elements that might influence the transcriptional regulation of their associated gene(s). It represents a new way to screen for disease-relevant mutations beyond the immediate vicinity of the specific disease gene. It promises to be useful for investigating disorders in which mutations could occur in remotely acting regulatory elements.


Asunto(s)
Proteínas Cromosómicas no Histona/genética , Metilación de ADN/genética , Distrofia Muscular Facioescapulohumeral/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Secuencia de Bases , Simulación por Computador , Epigénesis Genética , Femenino , Humanos , Masculino , Distrofia Muscular Facioescapulohumeral/patología , Mutación/genética , Linaje
7.
Mol Biosyst ; 11(1): 77-85, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25325903

RESUMEN

At the systems level many organisms of interest may be described by their patterns of interaction, and as such, are perhaps best characterised via network or graph models. Metabolic networks, in particular, are fundamental to the proper functioning of many important biological processes, and thus, have been widely studied over the past decade or so. Such investigations have revealed a number of shared topological features, such as a short characteristic path-length, large clustering coefficient and hierarchical modular structure. However, the extent to which evolutionary and functional properties of metabolism manifest via this underlying network architecture remains unclear. In this paper, we employ a novel graph embedding technique, based upon low-order network motifs, to compare metabolic network structure for 383 bacterial species categorised according to a number of biological features. In particular, we introduce a new global significance score which enables us to quantify important evolutionary relationships that exist between organisms and their physical environments. Using this new approach, we demonstrate a number of significant correlations between environmental factors, such as growth conditions and habitat variability, and network motif structure, providing evidence that organism adaptability leads to increased complexities in the resultant metabolic networks.


Asunto(s)
Redes y Vías Metabólicas , Modelos Biológicos , Algoritmos , Fenómenos Fisiológicos Bacterianos , Ecosistema , Microbiología Ambiental , Consumo de Oxígeno
8.
Hum Genomics ; 7: 18, 2013 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-23947441

RESUMEN

Neurofibromatosis type 1 (NF1), a neuroectodermal disorder, is caused by germline mutations in the NF1 gene. NF1 affects approximately 1/3,000 individuals worldwide, with about 50% of cases representing de novo mutations. Although the NF1 gene was identified in 1990, the underlying gene mutations still remain undetected in a small but obdurate minority of NF1 patients. We postulated that in these patients, hitherto undetected pathogenic mutations might occur in regulatory elements far upstream of the NF1 gene. In an attempt to identify such remotely acting regulatory elements, we reasoned that some of them might reside within DNA sequences that (1) have the potential to interact at distance with the NF1 gene and (2) lie within a histone H3K27ac-enriched region, a characteristic of active enhancers. Combining Hi-C data, obtained by means of the chromosome conformation capture technique, with data on the location and level of histone H3K27ac enrichment upstream of the NF1 gene, we predicted in silico the presence of two remotely acting regulatory regions, located, respectively, approximately 600 kb and approximately 42 kb upstream of the NF1 gene. These regions were then sequenced in 47 NF1 patients in whom no mutations had been found in either the NF1 or SPRED1 gene regions. Five patients were found to harbour DNA sequence variants in the distal H3K27ac-enriched region. Although these variants are of uncertain pathological significance and still remain to be functionally characterized, this approach promises to be of general utility for the detection of mutations underlying other inherited disorders that may be caused by mutations in remotely acting regulatory elements.


Asunto(s)
Simulación por Computador , Pruebas Genéticas , Mutación/genética , Neurofibromatosis 1/genética , Neurofibromina 1/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Acetilación , Secuencia de Bases , Histonas/genética , Humanos , Lisina/metabolismo
9.
Hum Genomics ; 6: 12, 2012 Aug 13.
Artículo en Inglés | MEDLINE | ID: mdl-23244495

RESUMEN

Neurofibromatosis type 1 (NF1) is a complex neurocutaneous disorder with an increased susceptibility to develop both benign and malignant tumors but with a wide spectrum of inter and intrafamilial clinical variability. The establishment of genotype-phenotype associations in NF1 is potentially useful for targeted therapeutic intervention but has generally been unsuccessful, apart from small subsets of molecularly defined patients. The objective of this study was to evaluate the clinical phenotype associated with the specific types of NF1 mutation in a retrospectively recorded clinical dataset comprising 149 NF1 mutation-known individuals from unrelated families. Each patient was assessed for ten NF1-related clinical features, including the number of café-au-lait spots, cutaneous and subcutaneous neurofibromas and the presence/absence of intertriginous skin freckling, Lisch nodules, plexiform and spinal neurofibromas, optic gliomas, other neoplasms (in particular CNS gliomas, malignant peripheral nerve sheath tumors (MPNSTs), juvenile myelomonocytic leukemia, rhabdomyosarcoma, phaechromocytoma, gastrointestinal stromal tumors, juvenile xanthogranuloma, and lipoma) and evidence of learning difficulties. Gender and age at examination were also recorded. Patients were subcategorized according to their associated NF1 germ line mutations: frame shift deletions (52), splice-site mutations (23), nonsense mutations (36), missense mutations (32) and other types of mutation (6). A significant association was apparent between possession of a splice-site mutation and the presence of brain gliomas and MPNSTs (p = 0.006). If confirmed, these findings are likely to be clinically important since up to a third of NF1 patients harbor splice-site mutations. A significant influence of gender was also observed on the number of subcutaneous neurofibromas (females, p = 0.009) and preschool learning difficulties (females, p = 0.022).


Asunto(s)
Estudios de Asociación Genética , Neurofibromatosis 1/diagnóstico , Neurofibromina 1/genética , Sitios de Empalme de ARN , Adolescente , Adulto , Anciano , Manchas Café con Leche/complicaciones , Manchas Café con Leche/genética , Niño , Preescolar , Estudios de Cohortes , Femenino , Predisposición Genética a la Enfermedad , Mutación de Línea Germinal , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Mutación Missense , Neurofibroma/complicaciones , Neurofibroma/genética , Neurofibromatosis 1/complicaciones , Neurofibromatosis 1/genética , Glioma del Nervio Óptico/complicaciones , Glioma del Nervio Óptico/genética , Factores de Riesgo , Factores Sexuales , Adulto Joven
10.
PLoS One ; 7(11): e49455, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23166675

RESUMEN

BACKGROUND: Species of Cronobacter are widespread in the environment and are occasional food-borne pathogens associated with serious neonatal diseases, including bacteraemia, meningitis, and necrotising enterocolitis. The genus is composed of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. dublinensis, C. muytjensii, C. universalis, and C. condimenti. Clinical cases are associated with three species, C. malonaticus, C. turicensis and, in particular, with C. sakazakii multilocus sequence type 4. Thus, it is plausible that virulence determinants have evolved in certain lineages. METHODOLOGY/PRINCIPAL FINDINGS: We generated high quality sequence drafts for eleven Cronobacter genomes representing the seven Cronobacter species, including an ST4 strain of C. sakazakii. Comparative analysis of these genomes together with the two publicly available genomes revealed Cronobacter has over 6,000 genes in one or more strains and over 2,000 genes shared by all Cronobacter. Considerable variation in the presence of traits such as type six secretion systems, metal resistance (tellurite, copper and silver), and adhesins were found. C. sakazakii is unique in the Cronobacter genus in encoding genes enabling the utilization of exogenous sialic acid which may have clinical significance. The C. sakazakii ST4 strain 701 contained additional genes as compared to other C. sakazakii but none of them were known specific virulence-related genes. CONCLUSIONS/SIGNIFICANCE: Genome comparison revealed that pair-wise DNA sequence identity varies between 89 and 97% in the seven Cronobacter species, and also suggested various degrees of divergence. Sets of universal core genes and accessory genes unique to each strain were identified. These gene sequences can be used for designing genus/species specific detection assays. Genes encoding adhesins, T6SS, and metal resistance genes as well as prophages are found in only subsets of genomes and have contributed considerably to the variation of genomic content. Differences in gene content likely contribute to differences in the clinical and environmental distribution of species and sequence types.


Asunto(s)
Cronobacter/genética , Evolución Molecular , Genoma Bacteriano/genética , Filogenia , Sistemas de Secreción Bacterianos/genética , Secuencia de Bases , Cronobacter/patogenicidad , Fimbrias Bacterianas/genética , Funciones de Verosimilitud , Modelos Genéticos , Datos de Secuencia Molecular , Familia de Multigenes/genética , Análisis de Secuencia de ADN , Especificidad de la Especie , Factores de Virulencia/genética
11.
Hum Mutat ; 33(11): 1599-609, 2012 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-22837079

RESUMEN

Nonallelic homologous recombination (NAHR) is one of the major mechanisms underlying copy number variation in the human genome. Although several disease-associated meiotic NAHR breakpoints have been analyzed in great detail, hotspots for mitotic NAHR are not well characterized. Type-2 NF1 microdeletions, which are predominantly of postzygotic origin, constitute a highly informative model with which to investigate the features of mitotic NAHR. Here, a custom-designed MLPA- and PCR-based approach was used to identify 23 novel NAHR-mediated type-2 NF1 deletions. Breakpoint analysis of these 23 type-2 deletions, together with 17 NAHR-mediated type-2 deletions identified previously, revealed that the breakpoints are nonuniformly distributed within the paralogous SUZ12 and SUZ12P sequences. Further, the analysis of this large group of type-2 deletions revealed breakpoint recurrence within short segments (ranging in size from 57 to 253-bp) as well as the existence of a novel NAHR hotspot of 1.9-kb (termed PRS4). This hotspot harbored 20% (8/40) of the type-2 deletion breakpoints and contains the 253-bp recurrent breakpoint region BR6 in which four independent type-2 deletion breakpoints were identified. Our findings indicate that a combination of an open chromatin conformation and short non-B DNA-forming repeats may predispose to recurrent mitotic NAHR events between SUZ12 and its pseudogene.


Asunto(s)
Anomalías Craneofaciales/genética , Genes de Neurofibromatosis 1 , Discapacidad Intelectual/genética , Discapacidades para el Aprendizaje/genética , Neurofibromatosis/genética , Eliminación de Secuencia , Secuencia de Bases , Deleción Cromosómica , Cromosomas Humanos Par 17/genética , Roturas del ADN , Variaciones en el Número de Copia de ADN , Recombinación Homóloga , Humanos , Mitosis/genética , Datos de Secuencia Molecular , Mosaicismo , Reacción en Cadena de la Polimerasa Multiplex , Proteínas de Neoplasias , Neurofibromatosis 1/genética , Complejo Represivo Polycomb 2/genética , Seudogenes , Homología de Secuencia de Ácido Nucleico , Factores de Transcripción
12.
Eur J Hum Genet ; 20(4): 411-9, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-22108604

RESUMEN

Neurofibromatosis type-1 (NF1), caused by heterozygous inactivation of the NF1 tumour suppressor gene, is associated with the development of benign and malignant peripheral nerve sheath tumours (MPNSTs). Although numerous germline NF1 mutations have been identified, relatively few somatic NF1 mutations have been described in neurofibromas. Here we have screened 109 cutaneous neurofibromas, excised from 46 unrelated NF1 patients, for somatic NF1 mutations. NF1 mutation screening (involving loss-of-heterozygosity (LOH) analysis, multiplex ligation-dependent probe amplification and DNA sequencing) identified 77 somatic NF1 point mutations, of which 53 were novel. LOH spanning the NF1 gene region was evident in 25 neurofibromas, but in contrast to previous data from MPNSTs, it was absent at the TP53, CDKN2A and RB1 gene loci. Analysis of DNA/RNA from neurofibroma-derived Schwann cell cultures revealed NF1 mutations in four tumours whose presence had been overlooked in the tumour DNA. Bioinformatics analysis suggested that four of seven novel somatic NF1 missense mutations (p.A330T, p.Q519P, p.A776T, p.S1463F) could be of functional/clinical significance. Functional analysis confirmed this prediction for p.S1463F, located within the GTPase-activating protein-related domain, as this mutation resulted in a 150-fold increase in activated GTP-bound Ras. Comparison of the relative frequencies of the different types of somatic NF1 mutation observed with those of their previously reported germline counterparts revealed significant (P=0.001) differences. Although non-identical somatic mutations involving either the same or adjacent nucleotides were identified in three pairs of tumours from the same patients (P<0.0002), no association was noted between the type of germline and somatic NF1 lesion within the same individual.


Asunto(s)
Genes de Neurofibromatosis 1 , Mutación , Neurofibromatosis 1/genética , Neoplasias Cutáneas/genética , Secuencia de Bases , Humanos , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Células Tumorales Cultivadas
13.
Hum Mutat ; 33(2): 372-83, 2012 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-22045503

RESUMEN

Nonallelic homologous recombination (NAHR) is the major mechanism underlying recurrent genomic rearrangements, including the large deletions at 17q11.2 that cause neurofibromatosis type 1 (NF1). Here, we identify a novel NAHR hotspot, responsible for type-3 NF1 deletions that span 1.0 Mb. Breakpoint clustering within this 1-kb hotspot, termed PRS3, was noted in 10 of 11 known type-3 NF1 deletions. PRS3 is located within the LRRC37B pseudogene of the NF1-REPb and NF1-REPc low-copy repeats. In contrast to other previously characterized NAHR hotspots, PRS3 has not developed on a preexisting allelic homologous recombination hotspot. Furthermore, the variation pattern of PRS3 and its flanking regions is unusual since only NF1-REPc (and not NF1-REPb) is characterized by a high single nucleotide polymorphism (SNP) frequency, suggestive of unidirectional sequence transfer via nonallelic homologous gene conversion (NAHGC). By contrast, the previously described intense NAHR hotspots within the CMT1A-REPs, and the PRS1 and PRS2 hotspots underlying type-1 NF1 deletions, experience frequent bidirectional sequence transfer. PRS3 within NF1-REPc was also found to be involved in NAHGC with the LRRC37B gene, the progenitor locus of the LRRC37B-P duplicons, as indicated by the presence of shared SNPs between these loci. PRS3 therefore represents a weak (and probably evolutionarily rather young) NAHR hotspot with unique properties.


Asunto(s)
Eliminación de Gen , Genes de Neurofibromatosis 1 , Recombinación Homóloga , Neurofibromatosis 1/genética , Secuencia de Bases , Proteínas Portadoras/genética , Puntos de Rotura del Cromosoma , Conversión Génica , Orden Génico , Humanos , Mosaicismo , Motivos de Nucleótidos , Polimorfismo de Nucleótido Simple
14.
BMC Microbiol ; 11: 204, 2011 Sep 20.
Artículo en Inglés | MEDLINE | ID: mdl-21933417

RESUMEN

BACKGROUND: Cronobacter, formerly known as Enterobacter sakazakii, is a food-borne pathogen known to cause neonatal meningitis, septicaemia and death. Current diagnostic tests for identification of Cronobacter do not differentiate between species, necessitating time consuming 16S rDNA gene sequencing or multilocus sequence typing (MLST). The organism is ubiquitous, being found in the environment and in a wide range of foods, although there is variation in pathogenicity between Cronobacter isolates and between species. Therefore to be able to differentiate between the pathogenic and non-pathogenic strains is of interest to the food industry and regulators. RESULTS: Here we report the use of Expectation Maximization clustering to categorise 98 strains of Cronobacter as pathogenic or non-pathogenic based on biochemical test results from standard diagnostic test kits. Pathogenicity of a strain was postulated on the basis of either pathogenic symptoms associated with strain source or corresponding MLST sequence types, allowing the clusters to be labelled as containing either pathogenic or non-pathogenic strains. The resulting clusters gave good differentiation of strains into pathogenic and non-pathogenic groups, corresponding well to isolate source and MLST sequence type. The results also revealed a potential association between pathogenicity and inositol fermentation. An investigation of the genomes of Cronobacter sakazakii and C. turicensis revealed the gene for inositol monophosphatase is associated with putative virulence factors in pathogenic strains of Cronobacter. CONCLUSIONS: We demonstrated a computational approach allowing existing diagnostic kits to be used to identify pathogenic strains of Cronobacter. The resulting clusters correlated well with MLST sequence types and revealed new information about the pathogenicity of Cronobacter species.


Asunto(s)
Técnicas de Tipificación Bacteriana/métodos , Biología Computacional/métodos , Cronobacter/química , Cronobacter/clasificación , Infecciones por Enterobacteriaceae/microbiología , Inositol/metabolismo , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Cronobacter/metabolismo , Cronobacter/patogenicidad , Fermentación , Microbiología de Alimentos , Humanos , Datos de Secuencia Molecular , Virulencia
15.
Hum Genomics ; 5(4): 241-64, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21712188

RESUMEN

'Nonstop' mutations are single base-pair substitutions that occur within translational termination (stop) codons and which can lead to the continued and inappropriate translation of the mRNA into the 3'-untranslated region. We have performed a meta-analysis of the 119 nonstop mutations (in 87 different genes) known to cause human inherited disease, examining the sequence context of the mutated stop codons and the average distance to the next alternative in-frame stop codon downstream, in comparison with their counterparts from control (non-mutated) gene sequences. A paucity of alternative in-frame stop codons was noted in the immediate vicinity (0-49 nucleotides downstream) of the mutated stop codons as compared with their control counterparts (p = 7.81 × 10-4). This implies that at least some nonstop mutations with alternative stop codons in close proximity will not have come to clinical attention, possibly because they will have given rise to stable mRNAs (not subject to nonstop mRNA decay) that are translatable into proteins of near-normal length and biological function. A significant excess of downstream in-frame stop codons was, however, noted in the range 150-199 nucleotides from the mutated stop codon (p = 8.55 × 10-4). We speculate that recruitment of an alternative stop codon at greater distance from the mutated stop codon may trigger nonstop mRNA decay, thereby decreasing the amount of protein product and yielding a readily discernible clinical phenotype. Confirmation or otherwise of this postulate must await the emergence of a clearer understanding of the mechanism of nonstop mRNA decay in mammalian cells.


Asunto(s)
Codón de Terminación/genética , Enfermedades Genéticas Congénitas/genética , Mutación Puntual/genética , Regiones no Traducidas 3'/genética , Codón sin Sentido/genética , Genoma Humano , Humanos , Mutación Missense/genética , Sistemas de Lectura Abierta/genética , Biosíntesis de Proteínas
16.
Hum Mutat ; 32(6): 620-32, 2011 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-21432943

RESUMEN

Mutations associated with tumorigenesis may either arise somatically or can be inherited through the germline. We performed a comparison of somatic, germline, shared (found in both soma and germline) and somatic recurrent mutational spectra for 17 human tumor suppressor genes, which focused upon missense single base-pair substitutions and microdeletions/microinsertions. Somatic and germline mutational spectra were similar in relation to C.G>T.A transitions but differed with respect to the frequency of A.T>G.C, A.T>T.A, and C.G>A.T substitutions. Shared missense mutations were characterized by higher mutability rates, greater physicochemical differences between wild-type and mutant residues, and a tendency to occur in evolutionarily conserved residues and within CpG/CpHpG oligonucleotides. Mononucleotide runs (≥4 bp) were identified as hotspots for shared microdeletions/microinsertions. Both germline and somatic microdeletions/microinsertions were found to be significantly overrepresented within the "indel-hotspot" motif, GTAAGT. Using a naïve Bayes' classifier trained to discriminate between five missense mutation groups, 63% of mutations in our dataset were on average correctly recognized. Applying this classifier to an independent dataset of probable driver mutations, we concluded that ∼50% of these somatic missense mutations possess features consistent with their being either shared or recurrent, suggesting that a disproportionate number of such lesions are likely to be drivers of tumorigenesis.


Asunto(s)
Transformación Celular Neoplásica/genética , Mutación de Línea Germinal/genética , Mutación/genética , Neoplasias/genética , Proteínas Supresoras de Tumor/genética , Biología Computacional , Análisis Mutacional de ADN , Humanos , Mutación INDEL
17.
J Biol Chem ; 286(12): 10017-26, 2011 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-21285356

RESUMEN

Although alternative DNA secondary structures (non-B DNA) can induce genomic rearrangements, their associated mutational spectra remain largely unknown. The helicase activity of WRN, which is absent in the human progeroid Werner syndrome, is thought to counteract this genomic instability. We determined non-B DNA-induced mutation frequencies and spectra in human U2OS osteosarcoma cells and assessed the role of WRN in isogenic knockdown (WRN-KD) cells using a supF gene mutation reporter system flanked by triplex- or Z-DNA-forming sequences. Although both non-B DNA and WRN-KD served to increase the mutation frequency, the increase afforded by WRN-KD was independent of DNA structure despite the fact that purified WRN helicase was found to resolve these structures in vitro. In U2OS cells, ∼70% of mutations comprised single-base substitutions, mostly at G·C base-pairs, with the remaining ∼30% being microdeletions. The number of mutations at G·C base-pairs in the context of NGNN/NNCN sequences correlated well with predicted free energies of base stacking and ionization potentials, suggesting a possible origin via oxidation reactions involving electron loss and subsequent electron transfer (hole migration) between neighboring bases. A set of ∼40,000 somatic mutations at G·C base pairs identified in a lung cancer genome exhibited similar correlations, implying that hole migration may also be involved. We conclude that alternative DNA conformations, WRN deficiency and lung tumorigenesis may all serve to increase the mutation rate by promoting, through diverse pathways, oxidation reactions that perturb the electron orbitals of neighboring bases. It follows that such "hole migration" is likely to play a much more widespread role in mutagenesis than previously anticipated.


Asunto(s)
ADN de Forma Z/metabolismo , Exodesoxirribonucleasas , Inestabilidad Genómica , Neoplasias Pulmonares/metabolismo , RecQ Helicasas , Eliminación de Secuencia , Línea Celular Tumoral , ADN de Forma Z/genética , Técnicas de Silenciamiento del Gen , Humanos , Neoplasias Pulmonares/genética , Helicasa del Síndrome de Werner
18.
Hum Genomics ; 4(6): 384-93, 2010 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-20846927

RESUMEN

A 130 base pair (bp) insertion (g.-8delCins130) into the 5' untranslated region of the PAFAH1B1 (LIS1) gene, seven nucleotides upstream of the translational initiation site, was detected in an isolated case of lissencephaly. The inserted DNA sequence exhibited perfect homology to two non-contiguous regions of the mitochondrial genome (8479 to 8545 and 8775 to 8835, containing portions of two genes, ATP8 and ATP6 ), as well as near-perfect homology (1 bp mismatch) to a nuclear mitochondrial pseudogene (NUMT) sequence located on chromosome 1p36. This lesion was not evident on polymerase chain reaction (PCR) sequence analysis of either parent, indicating that the mutation had occurred de novo in the patient. Experiments designed to distinguish between a mitochondrial and a nuclear genomic origin for the inserted DNA sequence were, however, inconclusive. Mitochondrial genome sequences from both the patient and his parents were sequenced and found to be identical to the sequence inserted into the PAFAH1B1 gene. Analysis of parental PCR products from the chromosome 1-specific NUMT were also consistent with the interpretation that the inserted sequence had originated directly from the mitochondrial genome. The chromosome 1-specific NUMT in the patient proved to be refractory to PCR analysis, however, suggesting that this region of chromosome 1 could have been deleted or rearranged. Although it remains by far the most likely scenario, in the absence of DNA sequence information from the patient's own chromosome 1-specific NUMT, we cannot unequivocally confirm that the 130 bp insertion originated from mitochondrial genome rather than from the NUMT.


Asunto(s)
1-Alquil-2-acetilglicerofosfocolina Esterasa/genética , Regiones no Traducidas 5'/genética , ADN Mitocondrial/genética , Genoma Mitocondrial/genética , Lisencefalia/genética , Proteínas Asociadas a Microtúbulos/genética , Mutagénesis Insercional/genética , Emparejamiento Base/genética , Secuencia de Bases , Niño , Preescolar , Cromosomas Humanos Par 17/genética , Análisis Mutacional de ADN , Exones/genética , Femenino , Humanos , Lactante , Recién Nacido , Masculino , Datos de Secuencia Molecular , Embarazo , Secuencias Repetitivas de Ácidos Nucleicos/genética , Homología de Secuencia de Ácido Nucleico
19.
Hum Genomics ; 4(6): 406-10, 2010 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-20846930

RESUMEN

The cytosine-guanine (CpG) dinucleotide has long been known to be a hotspot for pathological mutation in the human genome. This hypermutability is related to its role as the major site of cytosine methylation with the attendant risk of spontaneous deamination of 5-methylcytosine (5mC) to yield thymine. Cytosine methylation, however, also occurs in the context of CpNpG sites in the human genome, an unsurprising finding since the intrinsic symmetry of CpNpG renders it capable of supporting a semi-conservative model of replication of the methylation pattern. Recently, it has become clear that significant DNA methylation occurs in a CpHpG context (where H = A, C or T) in a variety of human somatic tissues. If we assume that CpHpG methylation also occurs in the germline, and that 5mC deamination can occur within a CpHpG context, then we might surmise that methylated CpHpG sites could also constitute mutation hotspots causing human genetic disease. To test this postulate, 54,625 missense and nonsense mutations from 2,113 genes causing inherited disease were retrieved from the Human Gene Mutation Database (http://www.hgmd.org). Some 18.2 per cent of these pathological lesions were found to be C → T and G → A transitions located in CpG dinucleotides (compatible with a model of methylation-mediated deamination of 5mC), an approximately ten-fold higher proportion than would have been expected by chance alone. The corresponding proportion for the CpHpG trinucleotide was 9.9 per cent, an approximately two-fold higher proportion than would have been expected by chance. We therefore estimate that ∼5 per cent of missense/nonsense mutations causing human inherited disease may be attributable to methylation-mediated deamination of 5mC within a CpHpG context.


Asunto(s)
5-Metilcitosina/metabolismo , Metilación de ADN/genética , Fosfatos de Dinucleósidos/genética , Enfermedades Genéticas Congénitas/genética , Mutación/genética , Repeticiones de Trinucleótidos/genética , Bases de Datos de Ácidos Nucleicos , Desaminación , Humanos
20.
Hum Mutat ; 31(10): 1163-73, 2010 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-20725927

RESUMEN

Nonallelic homologous recombination (NAHR) is responsible for the recurrent rearrangements that give rise to genomic disorders. Although meiotic NAHR has been investigated in multiple contexts, much less is known about mitotic NAHR despite its importance for tumorigenesis. Because type-2 NF1 microdeletions frequently result from mitotic NAHR, they represent a good model in which to investigate the features of mitotic NAHR. We have used microsatellite analysis and SNP arrays to distinguish between the various alternative recombinational possibilities, thereby ascertaining that 17 of 18 type-2 NF1 deletions, with breakpoints in the SUZ12 gene and its highly homologous pseudogene, originated via intrachromosomal recombination. This high proportion of intrachromosomal NAHR causing somatic type-2 NF1 deletions contrasts with the interchromosomal origin of germline type-1 NF1 microdeletions, whose breakpoints are located within the NF1-REPs (low-copy repeats located adjacent to the SUZ12 sequences). Further, meiotic NAHR causing type-1 NF1 deletions occurs within recombination hotspots characterized by high GC-content and DNA duplex stability, whereas the type-2 breakpoints associated with the mitotic NAHR events investigated here do not cluster within hotspots and are located within regions of significantly lower GC-content and DNA stability. Our findings therefore point to fundamental mechanistic differences between the determinants of mitotic and meiotic NAHR.


Asunto(s)
Cromosomas Humanos Par 17/genética , Mitosis/genética , Neurofibromina 1/genética , Recombinación Genética , Eliminación de Secuencia , Proteínas Portadoras/genética , Biología Computacional , Genes de Neurofibromatosis 1 , Humanos , Repeticiones de Microsatélite/genética , Proteínas de Neoplasias , Neurofibromatosis 1/genética , Proteínas Nucleares/genética , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Complejo Represivo Polycomb 2 , Polimorfismo de Nucleótido Simple , Factores de Transcripción
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA