Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Front Genet ; 12: 739054, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34745213

RESUMEN

Detecting gene fusions involving driver oncogenes is pivotal in clinical diagnosis and treatment of cancer patients. Recent developments in next-generation sequencing (NGS) technologies have enabled improved assays for bioinformatics-based gene fusions detection. In clinical applications, where a small number of fusions are clinically actionable, targeted polymerase chain reaction (PCR)-based NGS chemistries, such as the QIAseq RNAscan assay, aim to improve accuracy compared to standard RNA sequencing. Existing informatics methods for gene fusion detection in NGS-based RNA sequencing assays traditionally use a transcriptome-based spliced alignment approach or a de-novo assembly approach. Transcriptome-based spliced alignment methods face challenges with short read mapping yielding low quality alignments. De-novo assembly-based methods yield longer contigs from short reads that can be more sensitive for genomic rearrangements, but face performance and scalability challenges. Consequently, there exists a need for a method to efficiently and accurately detect fusions in targeted PCR-based NGS chemistries. We describe SeekFusion, a highly accurate and computationally efficient pipeline enabling identification of gene fusions from PCR-based NGS chemistries. Utilizing biological samples processed with the QIAseq RNAscan assay and in-silico simulated data we demonstrate that SeekFusion gene fusion detection accuracy outperforms popular existing methods such as STAR-Fusion, TOPHAT-Fusion and JAFFA-hybrid. We also present results from 4,484 patient samples tested for neurological tumors and sarcoma, encompassing details on some novel fusions identified.

2.
Sci Rep ; 11(1): 21680, 2021 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-34737383

RESUMEN

The changing landscape of genomics research and clinical practice has created a need for computational pipelines capable of efficiently orchestrating complex analysis stages while handling large volumes of data across heterogeneous computational environments. Workflow Management Systems (WfMSs) are the software components employed to fill this gap. This work provides an approach and systematic evaluation of key features of popular bioinformatics WfMSs in use today: Nextflow, CWL, and WDL and some of their executors, along with Swift/T, a workflow manager commonly used in high-scale physics applications. We employed two use cases: a variant-calling genomic pipeline and a scalability-testing framework, where both were run locally, on an HPC cluster, and in the cloud. This allowed for evaluation of those four WfMSs in terms of language expressiveness, modularity, scalability, robustness, reproducibility, interoperability, ease of development, along with adoption and usage in research labs and healthcare settings. This article is trying to answer, which WfMS should be chosen for a given bioinformatics application regardless of analysis type?. The choice of a given WfMS is a function of both its intrinsic language and engine features. Within bioinformatics, where analysts are a mix of dry and wet lab scientists, the choice is also governed by collaborations and adoption within large consortia and technical support provided by the WfMS team/community. As the community and its needs continue to evolve along with computational infrastructure, WfMSs will also evolve, especially those with permissive licenses that allow commercial use. In much the same way as the dataflow paradigm and containerization are now well understood to be very useful in bioinformatics applications, we will continue to see innovations of tools and utilities for other purposes, like big data technologies, interoperability, and provenance.


Asunto(s)
Biología Computacional/métodos , Programas Informáticos , Flujo de Trabajo , Macrodatos , Genómica , Humanos , Reproducibilidad de los Resultados
3.
NAR Cancer ; 3(3): zcab028, 2021 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-34316715

RESUMEN

Acquired PARP inhibitor (PARPi) resistance in BRCA1- or BRCA2-mutant ovarian cancer often results from secondary mutations that restore expression of functional protein. RAD51C is a less commonly studied ovarian cancer susceptibility gene whose promoter is sometimes methylated, leading to homologous recombination (HR) deficiency and PARPi sensitivity. For this study, the PARPi-sensitive patient-derived ovarian cancer xenograft PH039, which lacks HR gene mutations but harbors RAD51C promoter methylation, was selected for PARPi resistance by cyclical niraparib treatment in vivo. PH039 acquired PARPi resistance by the third treatment cycle and grew through subsequent treatment with either niraparib or rucaparib. Transcriptional profiling throughout the course of resistance development showed widespread pathway level changes along with a marked increase in RAD51C mRNA, which reflected loss of RAD51C promoter methylation. Analysis of ovarian cancer samples from the ARIEL2 Part 1 clinical trial of rucaparib monotherapy likewise indicated an association between loss of RAD51C methylation prior to on-study biopsy and limited response. Interestingly, the PARPi resistant PH039 model remained platinum sensitive. Collectively, these results not only indicate that PARPi treatment pressure can reverse RAD51C methylation and restore RAD51C expression, but also provide a model for studying the clinical observation that PARPi and platinum sensitivity are sometimes dissociated.

4.
Front Genet ; 10: 736, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31481971

RESUMEN

As reliable, efficient genome sequencing becomes ubiquitous, the need for similarly reliable and efficient variant calling becomes increasingly important. The Genome Analysis Toolkit (GATK), maintained by the Broad Institute, is currently the widely accepted standard for variant calling software. However, alternative solutions may provide faster variant calling without sacrificing accuracy. One such alternative is Sentieon DNASeq, a toolkit analogous to GATK but built on a highly optimized backend. We conducted an independent evaluation of the DNASeq single-sample variant calling pipeline in comparison to that of GATK. Our results support the near-identical accuracy of the two software packages, showcase optimal scalability and great speed from Sentieon, and describe computational performance considerations for the deployment of DNASeq.

5.
Nat Cell Biol ; 21(4): 531-532, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30643186

RESUMEN

In the version of this Article originally published, Supplementary Fig. 6j showed incorrect values for the LS and AG4 glutathione samples, and Fig. 5c and Supplementary Fig. 6j did not include all n = 6 samples for the hESC, Y-hiPSC and AG4-ZSCAN10 groups as was stated in the legend. In addition, the bars for hESC, Y-hiPSC, AG4-ZCNAN10, AG4 and LS in Supplementary Fig. 6i and j have been reproduced from Fig. 5b and c, respectively. Fig. 6e was also reproduced in the lower panel of Supplementary Fig. 6h, to enable direct comparison of the data, however this was not explained in the original figure legends. The correct versions of these figures and their legends are shown below, and Supplementary Table 5 has been updated with the source data for all numerical data in the manuscript.

6.
Cell Rep ; 21(10): 2965-2977, 2017 Dec 05.
Artículo en Inglés | MEDLINE | ID: mdl-29212039

RESUMEN

Metabolism is an emerging stem cell hallmark tied to cell fate, pluripotency, and self-renewal, yet systems-level understanding of stem cell metabolism has been limited by the lack of genome-scale network models. Here, we develop a systems approach to integrate time-course metabolomics data with a computational model of metabolism to analyze the metabolic state of naive and primed murine pluripotent stem cells. Using this approach, we find that one-carbon metabolism involving phosphoglycerate dehydrogenase, folate synthesis, and nucleotide synthesis is a key pathway that differs between the two states, resulting in differential sensitivity to anti-folates. The model also predicts that the pluripotency factor Lin28 regulates this one-carbon metabolic pathway, which we validate using metabolomics data from Lin28-deficient cells. Moreover, we identify and validate metabolic reactions related to S-adenosyl-methionine production that can differentially impact histone methylation in naive and primed cells. Our network-based approach provides a framework for characterizing metabolic changes influencing pluripotency and cell fate.


Asunto(s)
Células Madre Pluripotentes/metabolismo , Animales , Diferenciación Celular/fisiología , Células Madre Embrionarias/metabolismo , Histonas/metabolismo , Metabolómica , Ratones , Transducción de Señal/fisiología , Biología de Sistemas
7.
Stem Cell Reports ; 9(4): 1053-1061, 2017 10 10.
Artículo en Inglés | MEDLINE | ID: mdl-29020613

RESUMEN

The RNA exosome complex targets AU-rich element (ARE)-containing mRNAs in eukaryotic cells. We identified a transcription factor, ZSCAN10, which binds to the promoters of multiple RNA exosome complex subunits in pluripotent stem cells to maintain subunit gene expression. We discovered that induced pluripotent stem cell clones generated from aged tissue donors (A-iPSC) show poor expression of ZSCAN10, leading to poor RNA exosome complex expression, and a subsequent elevation in ARE-containing RNAs, including glutathione peroxidase 2 (Gpx2). Excess GPX2 leads to excess glutathione-mediated reactive oxygen species scavenging activity that blunts the DNA damage response and apoptosis. Expression of ZSCAN10 in A-iPSC recovers RNA exosome gene expression, the DNA damage response, and apoptosis. These findings reveal the central role of ZSCAN10 and the RNA exosome complex in maintaining pluripotent stem cell redox status to support a normal DNA damage response.


Asunto(s)
Complejo Multienzimático de Ribonucleasas del Exosoma/metabolismo , Oxidación-Reducción , Células Madre Pluripotentes/metabolismo , Factores de Edad , Apoptosis/genética , Daño del ADN , Expresión Génica , Regulación de la Expresión Génica , Inestabilidad Genómica , Glutatión/metabolismo , Glutatión Peroxidasa/genética , Glutatión Peroxidasa/metabolismo , Homeostasis , Células Madre Pluripotentes Inducidas/metabolismo , Especies Reactivas de Oxígeno/metabolismo , Donantes de Tejidos , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
8.
Nat Cell Biol ; 19(9): 1037-1048, 2017 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-28846095

RESUMEN

Induced pluripotent stem cells (iPSCs), which are used to produce transplantable tissues, may particularly benefit older patients, who are more likely to suffer from degenerative diseases. However, iPSCs generated from aged donors (A-iPSCs) exhibit higher genomic instability, defects in apoptosis and a blunted DNA damage response compared with iPSCs generated from younger donors. We demonstrated that A-iPSCs exhibit excessive glutathione-mediated reactive oxygen species (ROS) scavenging activity, which blocks the DNA damage response and apoptosis and permits survival of cells with genomic instability. We found that the pluripotency factor ZSCAN10 is poorly expressed in A-iPSCs and addition of ZSCAN10 to the four Yamanaka factors (OCT4, SOX2, KLF4 and c-MYC) during A-iPSC reprogramming normalizes ROS-glutathione homeostasis and the DNA damage response, and recovers genomic stability. Correcting the genomic instability of A-iPSCs will ultimately enhance our ability to produce histocompatible functional tissues from older patients' own cells that are safe for transplantation.


Asunto(s)
Células Madre Adultas/metabolismo , Envejecimiento/metabolismo , Reprogramación Celular , Proteínas de Unión al ADN/metabolismo , Células Madre Embrionarias/metabolismo , Inestabilidad Genómica , Células Madre Pluripotentes Inducidas/metabolismo , Donantes de Tejidos , Factores de Transcripción/metabolismo , Células Madre Adultas/patología , Factores de Edad , Anciano , Envejecimiento/genética , Envejecimiento/patología , Animales , Animales Recién Nacidos , Apoptosis , Diferenciación Celular , Proliferación Celular , Células Cultivadas , Técnicas de Reprogramación Celular , Daño del ADN , Proteínas de Unión al ADN/genética , Células Madre Embrionarias/patología , Regulación del Desarrollo de la Expresión Génica , Edad Gestacional , Glutatión/metabolismo , Humanos , Células Madre Pluripotentes Inducidas/patología , Factor 4 Similar a Kruppel , Factores de Transcripción de Tipo Kruppel/genética , Factores de Transcripción de Tipo Kruppel/metabolismo , Ratones , Ratones Transgénicos , Factor 3 de Transcripción de Unión a Octámeros/genética , Factor 3 de Transcripción de Unión a Octámeros/metabolismo , Estrés Oxidativo , Fenotipo , Proteínas Proto-Oncogénicas c-myc/genética , Proteínas Proto-Oncogénicas c-myc/metabolismo , Especies Reactivas de Oxígeno/metabolismo , Factores de Transcripción SOXB1/genética , Factores de Transcripción SOXB1/metabolismo , Transducción de Señal , Factores de Transcripción/genética , Transfección
9.
Acta Neuropathol ; 134(5): 715-728, 2017 11.
Artículo en Inglés | MEDLINE | ID: mdl-28808785

RESUMEN

We previously found C9orf72-associated (c9ALS) and sporadic amyotrophic lateral sclerosis (sALS) brain transcriptomes comprise thousands of defects, among which, some are likely key contributors to ALS pathogenesis. We have now generated complementary methylome data and combine these two data sets to perform a comprehensive "multi-omic" analysis to clarify the molecular mechanisms initiating RNA misregulation in ALS. We found that c9ALS and sALS patients have generally distinct but overlapping methylome profiles, and that the c9ALS- and sALS-affected genes and pathways have similar biological functions, indicating conserved pathobiology in disease. Our results strongly implicate SERPINA1 in both C9orf72 repeat expansion carriers and non-carriers, where expression levels are greatly increased in both patient groups across the frontal cortex and cerebellum. SERPINA1 expression is particularly pronounced in C9orf72 repeat expansion carriers for both brain regions, where SERPINA1 levels are strictly down regulated across most human tissues, including the brain, except liver and blood, and are not measurable in E18 mouse brain. The altered biological networks we identified contain critical molecular players known to contribute to ALS pathology, which also interact with SERPINA1. Our comprehensive combined methylation and transcription study identifies new genes and highlights that direct genetic and epigenetic changes contribute to c9ALS and sALS pathogenesis.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Proteína C9orf72/genética , Cerebelo/metabolismo , Metilación de ADN , Lóbulo Frontal/metabolismo , alfa 1-Antitripsina/genética , Esclerosis Amiotrófica Lateral/metabolismo , Esclerosis Amiotrófica Lateral/patología , Proteína C9orf72/metabolismo , Cerebelo/patología , Expansión de las Repeticiones de ADN , Exones , Lóbulo Frontal/patología , Humanos , alfa 1-Antitripsina/metabolismo
10.
Cell Stem Cell ; 19(1): 66-80, 2016 07 07.
Artículo en Inglés | MEDLINE | ID: mdl-27320042

RESUMEN

The RNA-binding proteins LIN28A and LIN28B play critical roles in embryonic development, tumorigenesis, and pluripotency, but their exact functions are poorly understood. Here, we show that, like LIN28A, LIN28B can function effectively with NANOG, OCT4, and SOX2 in reprogramming to pluripotency and that reactivation of both endogenous LIN28A and LIN28B loci are required for maximal reprogramming efficiency. In human fibroblasts, LIN28B is activated early during reprogramming, while LIN28A is activated later during the transition to bona fide induced pluripotent stem cells (iPSCs). In murine cells, LIN28A and LIN28B facilitate conversion from naive to primed pluripotency. Proteomic and metabolomic analysis highlighted roles for LIN28 in maintaining the low mitochondrial function associated with primed pluripotency and in regulating one-carbon metabolism, nucleotide metabolism, and histone methylation. LIN28 binds to mRNAs of proteins important for oxidative phosphorylation and modulates protein abundance. Thus, LIN28A and LIN28B play cooperative roles in regulating reprogramming, naive/primed pluripotency, and stem cell metabolism.


Asunto(s)
Proteínas de Unión al ADN/metabolismo , Células Madre Pluripotentes/citología , Células Madre Pluripotentes/metabolismo , Proteínas de Unión al ARN/metabolismo , Animales , Carbono/metabolismo , Reprogramación Celular , Fibroblastos/metabolismo , Histonas/metabolismo , Humanos , Metilación , Ratones , Nucleótidos/metabolismo , Oxidación-Reducción , Fosforilación Oxidativa , Proteoma/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo
11.
Nat Genet ; 48(5): 581-6, 2016 05.
Artículo en Inglés | MEDLINE | ID: mdl-26998690

RESUMEN

Antibiotic resistance is an increasingly serious public health threat. Understanding pathways allowing bacteria to survive antibiotic stress may unveil new therapeutic targets. We explore the role of the bacterial epigenome in antibiotic stress survival using classical genetic tools and single-molecule real-time sequencing to characterize genomic methylation kinetics. We find that Escherichia coli survival under antibiotic pressure is severely compromised without adenine methylation at GATC sites. Although the adenine methylome remains stable during drug stress, without GATC methylation, methyl-dependent mismatch repair (MMR) is deleterious and, fueled by the drug-induced error-prone polymerase Pol IV, overwhelms cells with toxic DNA breaks. In multiple E. coli strains, including pathogenic and drug-resistant clinical isolates, DNA adenine methyltransferase deficiency potentiates antibiotics from the ß-lactam and quinolone classes. This work indicates that the GATC methylome provides structural support for bacterial survival during antibiotic stress and suggests targeting bacterial DNA methylation as a viable approach to enhancing antibiotic activity.


Asunto(s)
Metilación de ADN , ADN Bacteriano/metabolismo , Farmacorresistencia Bacteriana/genética , Adenina/metabolismo , Escherichia coli/efectos de los fármacos , Escherichia coli/genética , Estrés Fisiológico
12.
Nat Neurosci ; 18(8): 1175-82, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-26192745

RESUMEN

Increasing evidence suggests that defective RNA processing contributes to the development of amyotrophic lateral sclerosis (ALS). This may be especially true for ALS caused by a repeat expansion in C9orf72 (c9ALS), in which the accumulation of RNA foci and dipeptide-repeat proteins are expected to modify RNA metabolism. We report extensive alternative splicing (AS) and alternative polyadenylation (APA) defects in the cerebellum of c9ALS subjects (8,224 AS and 1,437 APA), including changes in ALS-associated genes (for example, ATXN2 and FUS), and in subjects with sporadic ALS (sALS; 2,229 AS and 716 APA). Furthermore, heterogeneous nuclear ribonucleoprotein H (hnRNPH) and other RNA-binding proteins are predicted to be potential regulators of cassette exon AS events in both c9ALS and sALS. Co-expression and gene-association network analyses of gene expression and AS data revealed divergent pathways associated with c9ALS and sALS.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Cerebelo/metabolismo , Lóbulo Frontal/metabolismo , Regulación de la Expresión Génica/genética , Proteínas/genética , ARN/metabolismo , Transcriptoma/genética , Adulto , Anciano , Empalme Alternativo , Proteína C9orf72 , Estudios de Asociación Genética , Ribonucleoproteína Heterogénea-Nuclear Grupo F-H/metabolismo , Humanos , Persona de Mediana Edad , Poliadenilación/genética , Análisis de Secuencia de ARN
13.
Circ Cardiovasc Genet ; 8(2): 261-9, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25583992

RESUMEN

BACKGROUND: Genetic studies of cardiomyopathy and heart failure have limited throughput in mammalian models. Adult zebrafish have been recently pursued as a vertebrate model with higher throughput, but genetic conservation must be tested. METHODS AND RESULTS: We conducted transcriptome analysis of zebrafish heart and searched for fish homologues of 51 known human dilated cardiomyopathy-associated genes. We also identified genes with high cardiac expression and genes with differential expression between embryonic and adult stages. Among tested genes, 30 had a single zebrafish orthologue, 14 had 2 homologues, and 5 had ≥3 homologues. By analyzing the expression data on the basis of cardiac abundance and enrichment hypotheses, we identified a single zebrafish gene for 14 of 19 multiple-homologue genes and 2 zebrafish homologues of high priority for ACTC1. Of note, our data suggested vmhc and vmhcl as functional zebrafish orthologues for human genes MYH6 and MYH7, respectively, which are established molecular markers for cardiac remodeling. CONCLUSIONS: Most known genes for human dilated cardiomyopathy have a corresponding zebrafish orthologue, which supports the use of zebrafish as a conserved vertebrate model. Definition of the cardiac transcriptome and fetal gene program will facilitate systems biology studies of dilated cardiomyopathy in zebrafish.


Asunto(s)
Cardiomiopatía Dilatada/metabolismo , Regulación de la Expresión Génica , Transcriptoma , Proteínas de Pez Cebra/biosíntesis , Pez Cebra/metabolismo , Animales , Biomarcadores/metabolismo , Modelos Animales de Enfermedad , Humanos
14.
PLoS One ; 9(3): e92877, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24675726

RESUMEN

We present a new approach for pathogen surveillance we call Geogenomics. Geogenomics examines the geographic distribution of the genomes of pathogens, with a particular emphasis on those mutations that give rise to drug resistance. We engineered a new web system called Geogenomic Mutational Atlas of Pathogens (GoMAP) that enables investigation of the global distribution of individual drug resistance mutations. As a test case we examined mutations associated with HIV resistance to FDA-approved antiretroviral drugs. GoMAP-HIV makes use of existing public drug resistance and HIV protein sequence data to examine the distribution of 872 drug resistance mutations in ∼ 502,000 sequences for many countries in the world. We also implemented a broadened classification scheme for HIV drug resistance mutations. Several patterns for geographic distributions of resistance mutations were identified by visual mining using this web tool. GoMAP-HIV is an open access web application available at http://www.bio-toolkit.com/GoMap/project/


Asunto(s)
Enfermedades Transmisibles/etiología , Bases de Datos Genéticas , Genoma Microbiano , Genómica/métodos , Mutación , Vigilancia de la Población/métodos , Navegador Web , Geografía , Salud Global , Infecciones por VIH , Humanos
15.
Nat Commun ; 4: 1854, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23673639

RESUMEN

OP9 is a yet-uncultivated bacterial lineage found in geothermal systems, petroleum reservoirs, anaerobic digesters and wastewater treatment facilities. Here we use single-cell and metagenome sequencing to obtain two distinct, nearly complete OP9 genomes, one constructed from single cells sorted from hot spring sediments and the other derived from binned metagenomic contigs from an in situ-enriched cellulolytic, thermophilic community. Phylogenomic analyses support the designation of OP9 as a candidate phylum for which we propose the name 'Atribacteria'. Although a plurality of predicted proteins is most similar to those from Firmicutes, the presence of key genes suggests a diderm cell envelope. Metabolic reconstruction from the core genome suggests an anaerobic lifestyle based on sugar fermentation by Embden-Meyerhof glycolysis with production of hydrogen, acetate and ethanol. Putative glycohydrolases and an endoglucanase may enable catabolism of (hemi)cellulose in thermal environments. This study lays a foundation for understanding the physiology and ecological role of the 'Atribacteria'.


Asunto(s)
Bacterias/citología , Bacterias/genética , Celulosa/metabolismo , Fermentación , Metagenómica/métodos , Filogenia , Análisis de la Célula Individual/métodos , Membrana Celular/metabolismo , Metabolismo Energético , Metagenoma/genética , Datos de Secuencia Molecular , Análisis de Componente Principal
16.
PLoS One ; 7(5): e35964, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22574130

RESUMEN

Over 100 hot spring sediment samples were collected from 28 sites in 12 areas/regions, while recording as many coincident geochemical properties as feasible (>60 analytes). PCR was used to screen samples for Korarchaeota 16S rRNA genes. Over 500 Korarchaeota 16S rRNA genes were screened by RFLP analysis and 90 were sequenced, resulting in identification of novel Korarchaeota phylotypes and exclusive geographical variants. Korarchaeota diversity was low, as in other terrestrial geothermal systems, suggesting a marine origin for Korarchaeota with subsequent niche-invasion into terrestrial systems. Korarchaeota endemism is consistent with endemism of other terrestrial thermophiles and supports the existence of dispersal barriers. Korarchaeota were found predominantly in >55°C springs at pH 4.7-8.5 at concentrations up to 6.6×10(6) 16S rRNA gene copies g(-1) wet sediment. In Yellowstone National Park (YNP), Korarchaeota were most abundant in springs with a pH range of 5.7 to 7.0. High sulfate concentrations suggest these fluids are influenced by contributions from hydrothermal vapors that may be neutralized to some extent by mixing with water from deep geothermal sources or meteoric water. In the Great Basin (GB), Korarchaeota were most abundant at spring sources of pH<7.2 with high particulate C content and high alkalinity, which are likely to be buffered by the carbonic acid system. It is therefore likely that at least two different geological mechanisms in YNP and GB springs create the neutral to mildly acidic pH that is optimal for Korarchaeota. A classification support vector machine (C-SVM) trained on single analytes, two analyte combinations, or vectors from non-metric multidimensional scaling models was able to predict springs as Korarchaeota-optimal or sub-optimal habitats with accuracies up to 95%. To our knowledge, this is the most extensive analysis of the geochemical habitat of any high-level microbial taxon and the first application of a C-SVM to microbial ecology.


Asunto(s)
Inteligencia Artificial , Biodiversidad , Fenómenos Ecológicos y Ambientales , Manantiales de Aguas Termales , Korarchaeota/clasificación , Filogeografía , Ecosistema , Manantiales de Aguas Termales/química , Concentración de Iones de Hidrógeno , Korarchaeota/genética , Temperatura , Agua/química
17.
Nucleic Acids Res ; 40(Database issue): D252-60, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22146221

RESUMEN

Minimotif Miner (MnM available at http://minimotifminer.org or http://mnm.engr.uconn.edu) is an online database for identifying new minimotifs in protein queries. Minimotifs are short contiguous peptide sequences that have a known function in at least one protein. Here we report the third release of the MnM database which has now grown 60-fold to approximately 300,000 minimotifs. Since short minimotifs are by their nature not very complex we also summarize a new set of false-positive filters and linear regression scoring that vastly enhance minimotif prediction accuracy on a test data set. This online database can be used to predict new functions in proteins and causes of disease.


Asunto(s)
Secuencias de Aminoácidos , Bases de Datos de Proteínas , Secuencia de Aminoácidos , Secuencia de Consenso , Modelos Biológicos , Mapas de Interacción de Proteínas , Proteínas/genética , Análisis de Secuencia de Proteína
18.
Res Microbiol ; 161(10): 830-7, 2010 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-20870021

RESUMEN

Some members of the orders Bacillales and Clostridiales form dormant spores when subjected to environmental stress. These resistant spores return to normal vegetative growth upon encountering nutrients. A family of membrane-bound proteins called germination (Ger) receptors is tasked with detecting metabolites that serve as germination signals. During the characterization of tripartite Ger receptor proteins from the genera Bacillus and Clostridium, we found numerous nomenclature inconsistencies. In this work, we issued guidelines to remediate this problem. We generated a sample of sequenced Bacillus, Clostridium and related endospore-forming genera. Ger receptor proteins were recovered from these genomes by PSI-BLAST. The resulting Ger receptor protein sequences were then clustered by neighbor-joining based on sequence alignment. Inconsistencies in Ger receptor labeling were noted and new names proposed. Finally, a systematic approach for naming of new Ger receptors was designed.


Asunto(s)
Bacillales/genética , Clostridium/genética , Proteínas de la Membrana/genética , Esporas Bacterianas/genética , Terminología como Asunto , Bacillales/crecimiento & desarrollo , Bacillales/metabolismo , Clostridium/crecimiento & desarrollo , Clostridium/metabolismo , Análisis por Conglomerados , Proteínas de la Membrana/metabolismo , Homología de Secuencia de Aminoácido , Esporas Bacterianas/crecimiento & desarrollo , Esporas Bacterianas/metabolismo
19.
J Bacteriol ; 192(13): 3321-8, 2010 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-20435731

RESUMEN

Adaptive (stationary phase) mutagenesis is a phenomenon by which nondividing cells acquire beneficial mutations as a response to stress. Although the generation of adaptive mutations is essentially stochastic, genetic factors are involved in this phenomenon. We examined how defects in a transcriptional factor, previously reported to alter the acquisition of adaptive mutations, affected mutation levels in a gene under selection. The acquisition of mutations was directly correlated to the level of transcription of a defective leuC allele placed under selection. To further examine the correlation between transcription and adaptive mutation, we placed a point-mutated allele, leuC427, under the control of an inducible promoter and assayed the level of reversion to leucine prototrophy under conditions of leucine starvation. Our results demonstrate that the level of Leu(+) reversions increased significantly in parallel with the induced increase in transcription levels. This mutagenic response was not observed under conditions of exponential growth. Since transcription is a ubiquitous biological process, transcription-associated mutagenesis may influence evolutionary processes in all organisms.


Asunto(s)
Bacillus subtilis/genética , Mutación/genética , Transcripción Genética/genética , Regulación Bacteriana de la Expresión Génica/genética , Leucina/deficiencia , Mutagénesis , Reacción en Cadena de la Polimerasa
20.
Appl Environ Microbiol ; 75(14): 4835-52, 2009 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-19465526

RESUMEN

The marine bacterium strain MC-1 is a member of the alpha subgroup of the proteobacteria that contains the magnetotactic cocci and was the first member of this group to be cultured axenically. The magnetotactic cocci are not closely related to any other known alphaproteobacteria and are only distantly related to other magnetotactic bacteria. The genome of MC-1 contains an extensive (102 kb) magnetosome island that includes numerous genes that are conserved among all known magnetotactic bacteria, as well as some genes that are unique. Interestingly, certain genes that encode proteins considered to be important in magnetosome assembly (mamJ and mamW) are absent from the genome of MC-1. Magnetotactic cocci exhibit polar magneto-aerotaxis, and the MC-1 genome contains a relatively large number of identified chemotaxis genes. Although MC-1 is capable of both autotrophic and heterotrophic growth, it does not appear to be metabolically versatile, with heterotrophic growth confined to the utilization of acetate. Central carbon metabolism is encoded by genes for the citric acid cycle (oxidative and reductive), glycolysis, and gluconeogenesis. The genome also reveals the presence or absence of specific genes involved in the nitrogen, sulfur, iron, and phosphate metabolism of MC-1, allowing us to infer the presence or absence of specific biochemical pathways in strain MC-1. The pathways inferred from the MC-1 genome provide important information regarding central metabolism in this strain that could provide insights useful for the isolation and cultivation of new magnetotactic bacterial strains, in particular strains of other magnetotactic cocci.


Asunto(s)
Alphaproteobacteria/genética , ADN Bacteriano/genética , Genoma Bacteriano , Análisis de Secuencia de ADN , Acetatos/metabolismo , Alphaproteobacteria/fisiología , ADN Bacteriano/química , Genes Bacterianos , Islas Genómicas , Locomoción , Magnetismo , Redes y Vías Metabólicas/genética , Datos de Secuencia Molecular
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...