RESUMEN
With the emergence of new CRISPR/dCas9 tools that enable site specific modulation of DNA methylation and histone modifications, more detailed investigations of the contribution of epigenetic regulation to the precise phenotype of cells in culture, including recombinant production subclones, is now possible. These also allow a wide range of applications in metabolic engineering once the impact of such epigenetic modifications on the chromatin state is available. In this study, enhanced DNA methylation tools were targeted to a recombinant viral promoter (CMV), an endogenous promoter that is silenced in its native state in CHO cells, but had been reactivated previously (ß-galactoside α-2,6-sialyltransferase 1) and an active endogenous promoter (α-1,6-fucosyltransferase), respectively. Comparative ChIP-analysis of histone modifications revealed a general loss of active promoter histone marks and the acquisition of distinct repressive heterochromatin marks after targeted methylation. On the other hand, targeted demethylation resulted in autologous acquisition of active promoter histone marks and loss of repressive heterochromatin marks. These data suggest that DNA methylation directs the removal or deposition of specific histone marks associated with either active, poised or silenced chromatin. Moreover, we show that de novo methylation of the CMV promoter results in reduced transgene expression in CHO cells. Although targeted DNA methylation is not efficient, the transgene is repressed, thus offering an explanation for seemingly conflicting reports about the source of CMV promoter instability in CHO cells. Importantly, modulation of epigenetic marks enables to nudge the cell into a specific gene expression pattern or phenotype, which is stabilized in the cell by autologous addition of further epigenetic marks. Such engineering strategies have the added advantage of being reversible and potentially tunable to not only turn on or off a targeted gene, but also to achieve the setting of a desirable expression level.
Asunto(s)
Infecciones por Citomegalovirus , Metilación de ADN , Animales , Células CHO , Cricetinae , Cricetulus , Metilación de ADN/genética , Epigénesis Genética/genética , Código de Histonas/genéticaRESUMEN
The existence of dynamic cellular phenotypes in changing environmental conditions is of major interest for cell biologists who aim to understand the mechanism and sequence of regulation of gene expression. In the context of therapeutic protein production by Chinese Hamster Ovary (CHO) cells, a detailed temporal understanding of cell-line behavior and control is necessary to achieve a more predictable and reliable process performance. Of particular interest are data on dynamic, temporally resolved transcriptional regulation of genes in response to altered substrate availability and culture conditions. In this study, the gene transcription dynamics throughout a 9-day batch culture of CHO cells was examined by analyzing histone modifications and gene expression profiles in regular 12- and 24-hr intervals, respectively. Three levels of regulation were observed: (a) the presence or absence of DNA methylation in the promoter region provides an ON/OFF switch; (b) a temporally resolved correlation is observed between the presence of active transcription- and promoter-specific histone marks and the expression level of the respective genes; and (c) a major mechanism of gene regulation is identified by interaction of coding genes with long non-coding RNA (lncRNA), as observed in the regulation of the expression level of both neighboring coding/lnc gene pairs and of gene pairs where the lncRNA is able to form RNA-DNA-DNA triplexes. Such triplex-forming regions were predominantly found in the promoter or enhancer region of the targeted coding gene. Significantly, the coding genes with the highest degree of variation in expression during the batch culture are characterized by a larger number of possible triplex-forming interactions with differentially expressed lncRNAs. This indicates a specific role of lncRNA-triplexes in enabling rapid and large changes in transcription. A more comprehensive understanding of these regulatory mechanisms will provide an opportunity for new tools to control cellular behavior and to engineer enhanced phenotypes.
Asunto(s)
Técnicas de Cultivo Celular por Lotes/métodos , Epigénesis Genética/genética , Regulación de la Expresión Génica/genética , Adaptación Fisiológica , Animales , Células CHO , Cricetinae , Cricetulus , Perfilación de la Expresión Génica , ARN Largo no Codificante/genética , TranscriptomaRESUMEN
Accurate and complete genome sequences are essential in biotechnology to facilitate genome-based cell engineering efforts. The current genome assemblies for Cricetulus griseus, the Chinese hamster, are fragmented and replete with gap sequences and misassemblies, consistent with most short-read-based assemblies. Here, we completely resequenced C. griseus using single molecule real time sequencing and merged this with Illumina-based assemblies. This generated a more contiguous and complete genome assembly than either technology alone, reducing the number of scaffolds by >28-fold, with 90% of the sequence in the 122 longest scaffolds. Most genes are now found in single scaffolds, including up- and downstream regulatory elements, enabling improved study of noncoding regions. With >95% of the gap sequence filled, important Chinese hamster ovary cell mutations have been detected in draft assembly gaps. This new assembly will be an invaluable resource for continued basic and pharmaceutical research.
Asunto(s)
Cricetulus/genética , Genoma , Secuenciación Completa del Genoma , Animales , Biología Computacional , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADNRESUMEN
Recent sequencing of the Chinese hamster ovary (CHO) cell and Chinese hamster genomes has dramatically advanced our ability to understand the biology of these mammalian cell factories. In this study, we focus on the powerhouse of the CHO cell, the mitochondrion. Utilizing a high-resolution next generation sequencing approach we sequenced the Chinese hamster mitochondrial genome for the first time and surveyed the mutational landscape of CHO cell mitochondrial DNA (mtDNA). Depths of coverage ranging from ~3,319X to 8,056X enabled accurate identification of low frequency mutations (>1%), revealing that mtDNA heteroplasmy is widespread in CHO cells. A total of 197 variants at 130 individual nucleotide positions were identified across a panel of 22 cell lines with 81% of variants occurring at an allele frequency of between 1% and 99%. 89% of the heteroplasmic mutations identified were cell line specific with the majority of shared heteroplasmic SNPs and INDELs detected in clones from 2 cell line development projects originating from the same host cell line. The frequency of common predicted loss of function mutations varied significantly amongst the clones indicating that heteroplasmic mtDNA variation could lead to a continuous range of phenotypes and play a role in cell to cell, production run to production run and indeed clone to clone variation in CHO cell metabolism. Experiments that integrate mtDNA sequencing with metabolic flux analysis and metabolomics have the potential to improve cell line selection and enhance CHO cell metabolic phenotypes for biopharmaceutical manufacturing through rational mitochondrial genome engineering.
Asunto(s)
Alelos , Frecuencia de los Genes , Genoma Mitocondrial , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación INDEL , Polimorfismo de Nucleótido Simple , Animales , Células CHO , Cricetinae , CricetulusRESUMEN
The human mitochondrial genome has been reported to have a very high mutation rate as compared with the nuclear genome. A large number of mitochondrial mutations show significant phenotypic association and are involved in a broad spectrum of diseases. In recent years, there has been a remarkable progress in the understanding of mitochondrial genetics. The availability of next-generation sequencing (NGS) technologies have not only reduced sequencing cost by orders of magnitude but has also provided us good quality mitochondrial genome sequences with high coverage, thereby enabling decoding of a number of human mitochondrial diseases. In this study, we report a computational and experimental pipeline to decipher the human mitochondrial DNA variations and examine them for their clinical correlation. As a proof of principle, we also present a clinical study of a patient with Leigh disease and confirmed maternal inheritance of the causative allele. The pipeline is made available as a user-friendly online tool to annotate variants and find haplogroup, disease association, and heteroplasmic sites. The "mit-o-matic" computational pipeline represents a comprehensive cloud-based tool for clinical evaluation of mitochondrial genomic variations from NGS datasets. The tool is freely available at http://genome.igib.res.in/mitomatic/.
Asunto(s)
Biología Computacional/métodos , Variación Genética , Genoma Mitocondrial , Genómica/métodos , Mitocondrias/genética , Programas Informáticos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Enfermedades Mitocondriales/diagnóstico , Enfermedades Mitocondriales/genética , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN/métodosRESUMEN
Patients with estrogen receptor-positive breast cancer receive adjuvant endocrine therapies (ET) that delay relapse by targeting clinically undetectable micrometastatic deposits. Yet, up to 50% of patients relapse even decades after surgery through unknown mechanisms likely involving dormancy. To investigate genetic and transcriptional changes underlying tumor awakening, we analyzed late relapse patients and longitudinally profiled a rare cohort treated with long-term neoadjuvant ETs until progression. Next, we developed an in vitro evolutionary study to record the adaptive strategies of individual lineages in unperturbed parallel experiments. Our data demonstrate that ETs induce nongenetic cell state transitions into dormancy in a stochastic subset of cells via epigenetic reprogramming. Single lineages with divergent phenotypes awaken unpredictably in the absence of recurrent genetic alterations. Targeting the dormant epigenome shows promising activity against adapting cancer cells. Overall, this study uncovers the contribution of epigenetic adaptation to the evolution of resistance to ETs. SIGNIFICANCE: This study advances the understanding of therapy-induced dormancy with potential clinical implications for breast cancer. Estrogen receptor-positive breast cancer cells adapt to endocrine treatment by entering a dormant state characterized by strong heterochromatinization with no recurrent genetic changes. Targeting the epigenetic rewiring impairs the adaptation of cancer cells to ETs. See related commentary by Llinas-Bertran et al., p. 704. This article is featured in Selected Articles from This Issue, p. 695.
Asunto(s)
Neoplasias de la Mama , Epigénesis Genética , Humanos , Neoplasias de la Mama/genética , Neoplasias de la Mama/patología , Neoplasias de la Mama/tratamiento farmacológico , Femenino , Recurrencia Local de Neoplasia/genética , Regulación Neoplásica de la Expresión GénicaRESUMEN
Tuberculosis (TB) is the second highest cause of mortality after HIV/AIDS and is one of the leading public health problems worldwide. The growing resistance to anti-TB drugs and the recalcitrant nature of tenacious infections present arduous challenges for the treatment of TB. Thus, the need to develop therapeutics against novel drug targets to help overcome multi-drug resistant TB is inevitable. Leader peptidase B (LepB), the sole signal peptidase of Mycobacterium tuberculosis (MTb), is one such potential drug target. The present work aims at identifying potential inhibitors of LepB, so as to repress the formation of the functional proteins essential for the growth and pathogenesis of MTb. In this study, we screened a large dataset of natural compounds against LepB using a high throughput approach. The screening was directed toward a binding pocket consisting of residues, some of which are critical for the catalytic activity of the enzyme, while others are part of the conserved domains of the signal peptidases. We also carried out molecular dynamics simulations of the two top-scoring compounds in order to study their molecular interactions with the active site functional residues of LepB and also to assess their dynamic behavior. We report herein two prospective non-covalent type inhibitory drugs of natural origin which are active against tuberculosis. These lead molecules possess improved binding properties, have low toxicity and are specific against MTb.
Asunto(s)
Antituberculosos/farmacología , Farmacorresistencia Bacteriana Múltiple , Proteínas de la Membrana/antagonistas & inhibidores , Mycobacterium tuberculosis/efectos de los fármacos , Secuencia de Aminoácidos , Antituberculosos/química , Proteínas Bacterianas/antagonistas & inhibidores , Proteínas Bacterianas/química , Productos Biológicos/química , Ácidos Cafeicos/farmacología , Dominio Catalítico , Cromonas/farmacología , Bases de Datos Farmacéuticas , Disacáridos/farmacología , Activación Enzimática , Ensayos Analíticos de Alto Rendimiento , Ligandos , Proteínas de la Membrana/química , Simulación de Dinámica Molecular , Mycobacterium tuberculosis/química , Mycobacterium tuberculosis/enzimología , Conformación Proteica , Mapeo de Interacción de Proteínas , Estabilidad Proteica , Estructura Terciaria de Proteína , Serina Endopeptidasas/químicaRESUMEN
Recombinant mammalian host cell lines, in particular CHO and HEK293 cells, are used for the industrial production of therapeutic proteins. Despite their well-known genomic instability, the control mechanisms that enable cells to respond to changes in the environmental conditions are not yet fully understood, nor do we have a good understanding of the factors that lead to phenotypic shifts in long-term cultures. A contributing factor could be inherent diversity in transcriptomes within a population. In this study, we used a full-length coverage single-cell RNA sequencing (scRNA-seq) approach to investigate and compare cell-to-cell variability and the impact of standardized and homogenous culture conditions on the diversity of individual cell transcriptomes, comparing suspension CHO-K1 and adherent HEK293FT cells. Our data showed a critical batch effect from the sequencing of four 96-well plates of CHO-K1 single cells stored for different periods of time, which was and may be therefore identified as a technical variable to consider in experimental planning. Besides, in an artificial and controlled culture environment such as used in routine cell culture technology, the gene expression pattern of a given population does not reveal any marker gene capable to disclose relevant cell population substructures, both for CHO-K1 cells and for HEK293FT cells. The variation observed is primarily driven by the cell cycle.
Asunto(s)
Transcriptoma , Cricetinae , Animales , Humanos , Transcriptoma/genética , Células HEK293 , Células CHO , Cricetulus , Análisis de Secuencia de ARNRESUMEN
Modulation of expression levels of endogenous or recombinant genes can be of great interest for diverse applications, such as the study of genotype-phenotype relationships for a gene of interest, or fine-tuning of transcription to determine physiologically relevant effects of gene expression levels. During the last decades, several synthetic biology tools were established to control gene expression in mammalian cells such as Chinese hamster ovary (CHO) cells, one of the most important cell systems for basic research as well as the production of biopharmaceuticals. Here we describe the use of triplex forming oligos (TFOs), short RNA or ssDNA molecules that can bind to the major grove of their target duplex with great specificity, to control transgene expression in CHO cells. For proof of concept, a panel of TFOs with a size of 10-20 nts were designed with the help of the on-line tool Triplexator targeting the viral cytomegalovirus (CMV) promoter/enhancer region controlling the downstream reporter gene hCD4. The effect of TFOs was tested as ssDNA oligos pre-annealed to the promoter/enhancer region in vitro as well as upon endogenous transcription of the TFO as an RNA molecule binding to their target duplex in vivo. Results showed that not only binding of the TFO, but the exact location of triplex formation within the promoter/enhancer is paramount for transcription inhibition. After relieving a binding conflict by introducing a point mutation within the CMV promoter, longer TFOs (26-30 nts) could be designed and analysed. Selected TFOs achieved a reduction in recombinant hCD4 expression of up to 85% in CHO-K1 cells.
Asunto(s)
Productos Biológicos , Infecciones por Citomegalovirus , Cricetinae , Animales , Oligonucleótidos , Cricetulus , Células CHO , Citomegalovirus/genética , ARN , Expresión GénicaRESUMEN
Chinese hamster ovary (CHO) cells are the most widely used host for the expression of therapeutic proteins. Recently, significant progress has been made due to advances in genome sequence and annotation quality to unravel the black box CHO. Nevertheless, in many cases the link between genotype and phenotype in the context of suspension cultivated production cell lines is still not fully understood. While frameshift approaches targeting coding genes are frequently used, the non-coding regions of the genome have received less attention with respect to such functional annotation. Importantly, for non-coding regions frameshift knock-out strategies are not feasible. In this study, we developed a CRISPR-mediated screening approach that performs full deletions of genomic regions to enable the functional study of both the translated and untranslated genome. An in silico pipeline for the computational high-throughput design of paired guide RNAs (pgRNAs) directing CRISPR/AsCpf1 was established and used to generate a library tackling process-related genes and long non-coding RNAs. Next generation sequencing analysis of the plasmid library revealed a sufficient, but highly variable pgRNA composition. Recombinase-mediated cassette exchange was applied for pgRNA library integration rather than viral transduction to ensure single copy representation of pgRNAs per cell. After transient AsCpf1 expression, cells were cultivated over two sequential batches to identify pgRNAs which massively affected growth and survival. By comparing pgRNA abundance, depleted candidates were identified and individually validated to verify their effect.
RESUMEN
Chinese Hamster Ovary (CHO) cell lines are considered to be the preferred platform for the production of biotherapeutics, but issues related to expression instability remain unresolved. In this study, we investigated potential causes for an unstable phenotype by comparing cell lines that express stably to such that undergo loss in titer across 10 passages. Factors related to transgene integrity and copy number as well as the genomic profile around the integration sites were analyzed. Horizon Discovery CHO-K1 (HD-BIOP3) derived production cell lines selected for phenotypes with low, medium or high copy number, each with stable and unstable transgene expression, were sequenced to capture changes at genomic and transcriptomic levels. The exact sites of the random integration events in each cell line were also identified, followed by profiling of the genomic, transcriptomic and epigenetic patterns around them. Based on the information deduced from these random integration events, genomic loci that potentially favor reliable and stable transgene expression were reported for use as targeted transgene integration sites. By comparing stable vs unstable phenotypes across these parameters, we could establish that expression stability may be controlled at three levels: 1) Good choice of integration site, 2) Ensuring integrity of transgene and observing concatemerization pattern after integration, and 3) Checking for potential stress related cellular processes. Genome wide favorable and unfavorable genomic loci for targeted transgene integration can be browsed at https://www.borthlabchoresources.boku.ac.at/.
RESUMEN
The increasingdemandfor biopharmaceutical products drives the search for efficient cell factories that are able to sustainably support rapid growth, high productivity, and product quality. As these depend on energy generation, here the genomic variation in nuclear genes associated with mitochondria and energy metabolism and the mitochondrial genome of 14 cell lines is investigated. The variants called enable reliable tracing of lineages. Unique sequence variations are observed in cell lines adapted to grow in protein-free media, enriched in signaling pathways or mitogen-activated protein kinase 3. High-producing cell lines bear unique mutations in nicotinamide adenine dinucleotide (NADH) dehydrogenase (ND2 and ND4) and in peroxisomal acyl-CoA synthetase (ACSL4), involved in lipid metabolism. As phenotypes are determined not only by functional mutations, but also by the exquisite regulation of expression patterns, it is not surprising that ≈50% of the genes investigated here are found to be differentially methylated and thus epigenetically controlled, enabling a clear distinction of high producers, and cells adapted to a minimal, glutamine (Gln)-free medium. Similar pathways are enriched as those identified by genome variation. This strengthens the hypothesis that these phenomena act together to define cell behavior.
Asunto(s)
Metabolismo Energético/genética , Epigénesis Genética/genética , Genoma/genética , Mitocondrias/fisiología , Animales , Células CHO , Linaje de la Célula/genética , Cricetinae , Cricetulus , Genómica , Mutación/genética , Fenotipo , Proteínas RecombinantesRESUMEN
For the industrial production of recombinant proteins in mammalian cell lines, a high rate of gene expression is desired. Therefore, strong viral promoters are commonly used. However, these have several drawbacks as they override cellular responses, are not integrated into the cellular network, and thus can induce stress and potentially epigenetic silencing. Endogenous promoters potentially have the advantage of a better response to cellular state and thus a lower stress level by uncontrolled overexpression of the transgene. Such fine-tuning is typically achieved by endogenous enhancers and other regulatory elements, which are difficult to identify purely based on the genomic sequence. Here, Chinese hamster ovary (CHO) endogenous promoters and enhancers are identified using histone marks and chromatin states, ranked based on expression level and tested for normalized promoter strength. Successive truncation of these promoters at the 5'- and 3'-end as well as the combination with enhancers are identified in the vicinity of the promoter sequence further enhance promoter activity up to threefold. In an initial screen within stable cell lines, the strongest CHO promoter appears to be more stable than the human cytomegalovirus promoter with enhancer, making it a promising candidate for recombinant protein production and cell engineering applications. A deeper understanding of promoter functionality and response elements will be required to take full advantage of such promoters for cell engineering, in particular, for multigene network engineering applications.
Asunto(s)
Células CHO , Expresión Génica , Mejoramiento Genético/métodos , Regiones Promotoras Genéticas , Proteínas Recombinantes/biosíntesis , Proteínas Recombinantes/genética , Animales , Técnicas de Cultivo de Célula , Ingeniería Celular , Clonación Molecular , Simulación por Computador , Cricetinae , Cricetulus , Epigénesis Genética , Escherichia coli/genética , Humanos , Técnicas In Vitro , Transgenes/genéticaRESUMEN
Despite great efforts to control and modify gene expression of Chinese Hamster Ovary (CHO) cells by conventional genetic engineering approaches, i.e. overexpression or knockdown/-out, subclonal variation, induced unknown regulatory effects as well as overexpression stress are still a major hurdle for efficient cell line engineering and for unequivocal characterization of gene function. The use of epigenetic modulators - key players in CHO clonal heterogeneity - has only been marginally addressed so far. Here, we present the application of an alternative engineering strategy in CHO cells by utilizing targeted epigenetic editing tools that enable the turning-on or -off of genes without altering the genomic sequence. The present, but silent beta-galactoside alpha-2,6-sialyltransferase 1 (ST6GAL1) gene is activated by targeting the catalytic domain (CD) of Ten-Eleven Translocation methylcytosine dioxygenase 1 (TET1) via deactivated Cas9 (dCas9) to its methylated promoter. Stable upregulation in up to 60% of transfected cells is achieved over a time span of more than 80 days. No difference in growth and recombinant protein productivity is observed between activated and control cultures. Re-silencing by targeted methylation via DNA methyltransferase (DNMT) 3A-CD resulted in an up to 5.4-fold reduction of ST6GAL1 mRNA expression in ST6GAL1 expressing cells. This proof-of-concept demonstrates the feasibility of using epigenetic editing tools to efficiently modulate gene expression and provide a promising complement to conventional genetic engineering in CHO cells.
Asunto(s)
Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Epigenómica/métodos , Galactósidos/genética , Edición Génica/métodos , Sialiltransferasas/genética , Animales , Biocatálisis , Células CHO , Cricetulus , Metilasas de Modificación del ADN/metabolismo , Escherichia coli , Expresión Génica , Ingeniería Genética/métodos , Genómica , Regiones Promotoras Genéticas/genética , Proteínas Recombinantes/genéticaRESUMEN
Recent transcriptome annotation using deep sequencing approaches have annotated a large number of long non-coding RNAs in zebrafish, a popular model organism for human diseases. These studies characterized lncRNAs in critical developmental stages as well as adult tissues. Each of the studies has uncovered a distinct set of lncRNAs, with minor overlaps. The availability of the raw RNA-Seq datasets in public domain encompassing critical developmental time-points and adult tissues provides us with a unique opportunity to understand the spatiotemporal expression patterns of lncRNAs. In the present report, we created a catalog of lncRNAs in zebrafish, derived largely from the three annotation sets, as well as manual curation of literature to compile a total of 2,267 lncRNA transcripts in zebrafish. The lncRNAs were further classified based on the genomic context and relationship with protein coding gene neighbors into 4 categories. Analysis revealed a total of 86 intronic, 309 promoter associated, 485 overlapping and 1,386 lincRNAs. We created a comprehensive resource which houses the annotation of lncRNAs as well as associated information including expression levels, promoter epigenetic marks, genomic variants and retroviral insertion mutants. The resource also hosts a genome browser where the datasets could be browsed in the genome context. To the best of our knowledge, this is the first comprehensive resource providing a unified catalog of lncRNAs in zebrafish. The resource is freely available at URL: http://genome.igib.res.in/zflncRNApedia.
Asunto(s)
Bases de Datos de Ácidos Nucleicos , Internet , ARN Largo no Codificante/genética , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Pez Cebra/genética , Animales , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Anotación de Secuencia Molecular , Regiones Promotoras Genéticas/genética , TranscriptomaRESUMEN
Mycobacterium tuberculosis, along with closely related species, commonly known as M. tuberculosis complex (MTBC), causes tuberculosis in humans and other organisms. Tuberculosis is a disease with high morbidity and mortality, especially in the third world. The genetic variability between clinical isolates of MTBC has been poorly understood, although recent years have seen the re-sequencing of a large number of clinical isolates of MTBC from around the world. The availability of genomic data of multiple isolates in public domain would potentially offer a unique opportunity toward understanding the variome of the organism and the functional consequences of the variations. This nevertheless has been limited by the lack of systematic curation and analysis of data sets available in public domain. In this report, we have re-analyzed re-sequencing data sets corresponding to >450 isolates of MTBC available in public domain to create a comprehensive variome map of MTBC comprising >29 000 single nucleotide variations. Using a systematic computational pipeline, we have annotated potential functional variants and drug-resistance-associated variants from the variome. We have made available this data set as a searchable database. Apart from a user-friendly interface, the database also has a novel option to annotate variants from clinical re-sequencing data sets of MTBC. To the best of our knowledge, tbvar is the largest and most comprehensive genome variation resources for MTBC. Database URL: http://genome.igib.res.in/tbvar/
Asunto(s)
Bases de Datos Genéticas , Variación Genética , Genoma Bacteriano/genética , Mycobacterium tuberculosis/genética , Programas Informáticos , Humanos , Internet , Anotación de Secuencia Molecular , Estadística como AsuntoRESUMEN
Tuberculosis (TB) is an infectious disease caused by fastidious pathogen Mycobacterium tuberculosis. TB has emerged as one of the major causes of mortality in the developing world. Role of host genetic factors that modulate disease susceptibility have not been studied widely. Recent studies have reported few genetic loci that provide impetus to this area of research. The availability of tools has enabled genome-wide scans for disease susceptibility loci associated with infectious diseases. Till now, information on human genetic variations and their associated genes that modulate TB susceptibility have not been systematically compiled. In this work, we have created a resource: HGV&TB, which hosts genetic variations reported to be associated with TB susceptibility in humans. It currently houses information on 307 variations in 98 genes. In total, 101 of these variations are exonic, whereas 78 fall in intronic regions. We also analysed the pathogenicity of the genetic variations, their phenotypic consequences and ethnic origin. Using various computational analyses, 30 variations of the 101 exonic variations were predicted to be pathogenic. The resource is freely available at http://genome.igib.res.in/hgvtb/index.html. Using integrative analysis, we have shown that the disease associated variants are selectively enriched in the immune signalling pathways which are crucial in the pathophysiology of TB. Database URL: http://genome.igib.res.in/hgvtb/index.html
Asunto(s)
Predisposición Genética a la Enfermedad , Variación Genética , Internet , Tuberculosis/genética , Mapeo Cromosómico , Bases de Datos Genéticas , Sitios Genéticos , Genética de Población , Genoma Humano/genética , Humanos , Fenotipo , Carácter Cuantitativo Heredable , Programas InformáticosRESUMEN
A large repertoire of gene-centric data has been generated in the field of zebrafish biology. Although the bulk of these data are available in the public domain, most of them are not readily accessible or available in nonstandard formats. One major challenge is to unify and integrate these widely scattered data sources. We tested the hypothesis that active community participation could be a viable option to address this challenge. We present here our approach to create standards for assimilation and sharing of information and a system of open standards for database intercommunication. We have attempted to address this challenge by creating a community-centric solution for zebrafish gene annotation. The Zebrafish GenomeWiki is a 'wiki'-based resource, which aims to provide an altruistic shared environment for collective annotation of the zebrafish genes. The Zebrafish GenomeWiki has features that enable users to comment, annotate, edit and rate this gene-centric information. The credits for contributions can be tracked through a transparent microattribution system. In contrast to other wikis, the Zebrafish GenomeWiki is a 'structured wiki' or rather a 'semantic wiki'. The Zebrafish GenomeWiki implements a semantically linked data structure, which in the future would be amenable to semantic search. Database URL: http://genome.igib.res.in/twiki.