Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Microb Cell Fact ; 16(1): 63, 2017 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-28420406

RESUMEN

BACKGROUND: Efficient biomass bioconversion is a promising solution to alternative energy resources and environmental issues associated with lignocellulosic wastes. The Trichoderma species of cellulolytic fungi have strong cellulose-degrading capability, and their cellulase systems have been extensively studied. Currently, a major limitation of Trichoderma strains is their low production of ß-glucosidases. RESULTS: We isolated two Trichoderma hamatum strains YYH13 and YYH16 with drastically different cellulose degrading efficiencies. YYH13 has higher cellobiose-hydrolyzing efficiency. To understand mechanisms underlying such differences, we sequenced the genomes of YYH13 and YYH16, which are essentially identical (38.93 and 38.92 Mb, respectively) and are similar to that of the T. hamatum strain GD12. Using GeneMark-ES, we annotated 11,316 and 11,755 protein-coding genes in YYH13 and YYH16, respectively. Comparative analysis identified 13 functionally important genes in YYH13 under positive selection. Through examining orthologous relationships, we identified 172,655, and 320 genome-specific genes in YYH13, YYH16, and GD12, respectively. We found 15 protease families that show differences between YYH13 and YYH16. Enzymatic tests showed that exoglucanase, endoglucanase, and ß-glucosidase activities were higher in YYH13 than YYH16. Additionally, YYH13 contains 10 families of carbohydrate-active enzymes, including GH1, GH3, GH18, GH35, and GH55 families of chitinases, glucosidases, galactosidases, and glucanases, which are subject to stronger positive selection pressure. Furthermore, we found that the ß-glucosidase gene (YYH1311079) and pGEX-KG/YYH1311079 bacterial expression vector may provide valuable insight for designing ß-glucosidase with higher cellobiose-hydrolyzing efficiencies. CONCLUSIONS: This study suggests that the YYH13 strain of T. hamatum has the potential to serve as a model organism for producing cellulase because of its strong ability to efficiently degrade cellulosic biomass. The genome sequences of YYH13 and YYH16 represents a valuable resource for studying efficient production of biofuels.


Asunto(s)
Celobiosa/metabolismo , Genoma Fúngico , Trichoderma/genética , Trichoderma/metabolismo , Biocombustibles , Biomasa , Celulasa/biosíntesis , Celulasa/genética , Celulasa/metabolismo , Celulosa/metabolismo , Fermentación , Variación Genética , Genómica , Hidrólisis , Péptido Hidrolasas/genética , Péptido Hidrolasas/metabolismo , Análisis de Secuencia de ADN , Trichoderma/enzimología , beta-Glucosidasa/genética , beta-Glucosidasa/metabolismo
2.
BMC Genomics ; 16: 1039, 2015 Dec 09.
Artículo en Inglés | MEDLINE | ID: mdl-26645802

RESUMEN

BACKGROUND: The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research. RESULTS: We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat. CONCLUSIONS: Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.


Asunto(s)
Grano Comestible/genética , Genoma de Planta , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Transcriptoma , Triticum/genética , Biología Computacional/métodos , Mapeo Contig , Regulación del Desarrollo de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Sitios Genéticos , Genómica/métodos , Glútenes/genética , Sistemas de Lectura Abierta
3.
Genome Res ; 22(8): 1567-80, 2012 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-22772596

RESUMEN

Curation of a high-quality gene set is the critical first step in genome research, enabling subsequent analyses such as ortholog assignment, cis-regulatory element finding, and synteny detection. In this project, we have reannotated the genome of Caenorhabditis briggsae, the best studied sister species of the model organism Caenorhabditis elegans. First, we applied a homology-based gene predictor genBlastG to annotate the C. briggsae genome. We then validated and further improved the C. briggsae gene annotation through RNA-seq analysis of the C. briggsae transcriptome, which resulted in the first validated C. briggsae gene set (23,159 genes), among which 7347 genes (33.9% of all genes with introns) have all of their introns confirmed. Most genes (14,812, or 68.3%) have at least one intron validated, compared with only 3.9% in the most recent WormBase release (WS228). Of all introns in the revised gene set (103,083), 61,503 (60.1%) have been confirmed. Additionally, we have identified numerous trans-splicing leaders (SL1 and SL2 variants) in C. briggsae, leading to the first genome-wide annotation of operons in C. briggsae (1105 operons). The majority of the annotated operons (564, or 51.0%) are perfectly conserved in C. elegans, with an additional 345 operons (or 31.2%) somewhat divergent. Additionally, RNA-seq analysis revealed over 10 thousand small-size assembly errors in the current C. briggsae reference genome that can be readily corrected. The revised C. briggsae genome annotation represents a solid platform for comparative genomics analysis and evolutionary studies of Caenorhabditis species.


Asunto(s)
Caenorhabditis/genética , Genoma de los Helmintos , Anotación de Secuencia Molecular/métodos , Análisis de Secuencia de ARN/métodos , Transcriptoma , Empalme Alternativo , Animales , Secuencia de Bases , Secuencia Conservada , Evolución Molecular , Perfilación de la Expresión Génica/métodos , Intrones , Modelos Genéticos , Operón , Sitios de Empalme de ARN , ARN Lider Empalmado/genética , ARN Lider Empalmado/metabolismo , Alineación de Secuencia/métodos , Sintenía , Trans-Empalme
4.
BMC Genomics ; 15: 255, 2014 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-24694239

RESUMEN

BACKGROUND: Increasing genetic and phenotypic differences found among natural isolates of C. elegans have encouraged researchers to explore the natural variation of this nematode species. RESULTS: Here we report on the identification of genomic differences between the reference strain N2 and the Hawaiian strain CB4856, one of the most genetically distant strains from N2. To identify both small- and large-scale genomic variations (GVs), we have sequenced the CB4856 genome using both Roche 454 (~400 bps single reads) and Illumina GA DNA sequencing methods (101 bps paired-end reads). Compared to previously described variants (available in WormBase), our effort uncovered twice as many single nucleotide variants (SNVs) and increased the number of small InDels almost 20-fold. Moreover, we identified and validated large insertions, most of which range from 150 bps to 1.2 kb in length in the CB4856 strain. Identified GVs had a widespread impact on protein-coding sequences, including 585 single-copy genes that have associated severe phenotypes of reduced viability in RNAi and genetics studies. Sixty of these genes are homologs of human genes associated with diseases. Furthermore, our work confirms previously identified GVs associated with differences in behavioural and biological traits between the N2 and CB4856 strains. CONCLUSIONS: The identified GVs provide a rich resource for future studies that aim to explain the genetic basis for other trait differences between the N2 and CB4856 strains.


Asunto(s)
Caenorhabditis elegans/genética , Variación Genética , Genoma de los Helmintos , Animales , Composición de Base , Caenorhabditis elegans/efectos de los fármacos , Mapeo Cromosómico , Codón , Hibridación Genómica Comparativa , Biología Computacional , Elementos Transponibles de ADN , Resistencia a Medicamentos/genética , Estudios de Asociación Genética , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación INDEL , Familia de Multigenes , Mutagénesis Insercional , Sistemas de Lectura Abierta , Fenotipo , Polimorfismo de Nucleótido Simple , Eliminación de Secuencia
5.
BMC Evol Biol ; 10: 130, 2010 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-20441589

RESUMEN

BACKGROUND: Intraflagellar transport (IFT) genes, which are critical for the development and function of cilia and flagella in metazoans, are tightly regulated by the Regulatory Factor X (RFX) transcription factors (TFs). However, how and when their evolutionary relationship was established remains unknown. RESULTS: We have identified evidence suggesting that RFX TFs and IFT genes evolved independently and their evolution converged before the first appearance of metazoans. Both ciliary genes and RFX TFs exist in all metazoans as well as some unicellular eukaryotes. However, while RFX TFs and IFT genes are found simultaneously in all sequenced metazoan genomes, RFX TFs do not co-exist with IFT genes in most pre-metazoans and thus do not regulate them in these organisms. For example, neither the budding yeast nor the fission yeast possesses cilia although both have well-defined RFX TFs. Conversely, most unicellular eukaryotes, including the green alga Chlamydomonas reinhardtii, have typical cilia and well conserved IFT genes but lack RFX TFs. Outside of metazoans, RFX TFs and IFT genes co-exist only in choanoflagellates including M. brevicollis, and only one fungus Allomyces macrogynus of the 51 sequenced fungus genomes. M. brevicollis has two putative RFX genes and a full complement of ciliary genes. CONCLUSIONS: The evolution of RFX TFs and IFT genes were independent in pre-metazoans. We propose that their convergence in evolution, or the acquired transcriptional regulation of IFT genes by RFX TFs, played a pivotal role in the establishment of metazoan.


Asunto(s)
Chlamydomonas reinhardtii/genética , Cilios/metabolismo , Proteínas de Unión al ADN/genética , Evolución Molecular , Proteínas Protozoarias/genética , Factores de Transcripción/genética , Secuencia de Aminoácidos , Animales , Chlamydomonas reinhardtii/metabolismo , Proteínas de Unión al ADN/química , Humanos , Proteínas Protozoarias/química , Factores de Transcripción del Factor Regulador X , Alineación de Secuencia , Factores de Transcripción/química
6.
BMC Cell Biol ; 11: 71, 2010 Sep 21.
Artículo en Inglés | MEDLINE | ID: mdl-20858267

RESUMEN

BACKGROUND: The spindle assembly checkpoint (SAC) delays anaphase onset by inhibiting the activity of the anaphase promoting complex/cyclosome (APC/C) until all of the kinetochores have properly attached to the spindle. The importance of SAC genes for genome stability is well established; however, the roles these genes play, during postembryonic development of a multicellular organism, remain largely unexplored. RESULTS: We have used GFP fusions of 5' upstream intergenic regulatory sequences to assay spatiotemporal expression patterns of eight conserved genes implicated in the spindle assembly checkpoint function in Caenorhabditis elegans. We have shown that regulatory sequences for all of the SAC genes drive ubiquitous GFP expression during early embryonic development. However, postembryonic spatial analysis revealed distinct, tissue-specific expression of SAC genes with striking co-expression in seam cells, as well as in the gut. Additionally, we show that the absence of MDF-2/Mad2 (one of the checkpoint genes) leads to aberrant number and alignment of seam cell nuclei, defects mainly attributed to abnormal postembryonic cell proliferation. Furthermore, we show that these defects are completely rescued by fzy-1(h1983)/CDC20, suggesting that regulation of the APC/CCDC20 by the SAC component MDF-2 is important for proper postembryonic cell proliferation. CONCLUSION: Our results indicate that SAC genes display different tissue-specific expression patterns during postembryonic development in C. elegans with significant co-expression in hypodermal seam cells and gut cells, suggesting that these genes have distinct as well as overlapping roles in postembryonic development that may or may not be related to their established roles in mitosis. Furthermore, we provide evidence, by monitoring seam cell lineage, that one of the checkpoint genes is required for proper postembryonic cell proliferation. Importantly, our research provides the first evidence that postembryonic cell division is more sensitive to SAC loss, in particular MDF-2 loss, than embryonic cell division.


Asunto(s)
Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Ciclo Celular/metabolismo , Genes cdc/fisiología , Huso Acromático/metabolismo , Regiones no Traducidas 5'/genética , Animales , Caenorhabditis elegans/embriología , Caenorhabditis elegans/crecimiento & desarrollo , Proteínas de Caenorhabditis elegans/genética , Proteínas Cdc20 , Proteínas de Ciclo Celular/genética , Proliferación Celular , Desarrollo Embrionario/genética , Perfilación de la Expresión Génica , Ingeniería Genética , Proteínas Fluorescentes Verdes/genética , Proteínas Fluorescentes Verdes/metabolismo , Morfogénesis/genética , Mutación/genética , Unión Proteica , Huso Acromático/genética
7.
G3 (Bethesda) ; 6(1): 133-40, 2015 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-26564951

RESUMEN

Cancer therapy largely depends on chemotherapeutic agents that generate DNA lesions. However, our understanding of the nature of the resulting lesions as well as the mutational profiles of these chemotherapeutic agents is limited. Among these lesions, DNA interstrand crosslinks are among the more toxic types of DNA damage. Here, we have characterized the mutational spectrum of the commonly used DNA interstrand crosslinking agent mitomycin C (MMC). Using a combination of genetic mapping, whole genome sequencing, and genomic analysis, we have identified and confirmed several genomic lesions linked to MMC-induced DNA damage in Caenorhabditis elegans. Our data indicate that MMC predominantly causes deletions, with a 5'-CpG-3' sequence context prevalent in the deleted regions of DNA. Furthermore, we identified microhomology flanking the deletion junctions, indicative of DNA repair via nonhomologous end joining. Based on these results, we propose a general repair mechanism that is likely to be involved in the biological response to this highly toxic agent. In conclusion, the systematic study we have described provides insight into potential sequence specificity of MMC with DNA.


Asunto(s)
Antineoplásicos/toxicidad , Caenorhabditis elegans/efectos de los fármacos , Caenorhabditis elegans/genética , Genoma , Genómica , Mitomicina/toxicidad , Mutación/efectos de los fármacos , Animales , Mapeo Cromosómico , Biología Computacional/métodos , Daño del ADN/efectos de los fármacos , Genes Letales , Genómica/métodos , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Mutagénesis Insercional , Mutágenos/farmacología , Tasa de Mutación , Polimorfismo de Nucleótido Simple , Eliminación de Secuencia
8.
Nat Commun ; 5: 5696, 2014 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-25526346

RESUMEN

Intrahepatic cholangiocarcinoma (ICC) is a fatal primary liver cancer (PLC) that affects 5-10% of all PLCs. Here we sequence tumour and matching control sample pairs of a large cohort of 103 ICC patients in China, resulting in the identification of an ICC-specific somatic mutational signature that is associated with liver inflammation, fibrosis and cirrhosis. We further uncover 25 significantly mutated genes including eight potential driver genes (TP53, KRAS, IDH1, PTEN, ARID1A, EPPK1, ECE2 and FYN). We find that TP53-defective ICC patients are more likely to be HBsAg-seropositive, whereas mutations in the oncogene KRAS are nearly exclusively found in HBsAg-seronegative ICC patients. Three pathways (Ras/phosphatidylinositol-4,5-bisphosphate 3-kinase signalling, p53/cell cycle signalling and transforming growth factor-ß/Smad signalling), genes important for epigenetic regulation and oxidative phosphorylation are substantially affected in ICC. We reveal mutations in this study that may be valuable for designing further studies, better diagnosis and effective therapies.


Asunto(s)
Neoplasias de los Conductos Biliares/genética , Colangiocarcinoma/genética , Neoplasias Hepáticas/genética , Mutación , Adulto , Anciano , Conductos Biliares Intrahepáticos , China , Estudios de Cohortes , Femenino , Regulación Neoplásica de la Expresión Génica , Humanos , Isocitrato Deshidrogenasa/genética , Masculino , Persona de Mediana Edad , Fosfohidrolasa PTEN/genética , Proteínas Proto-Oncogénicas/genética , Proteínas Proto-Oncogénicas p21(ras) , Proteína p53 Supresora de Tumor/genética , Proteínas ras/genética
9.
Genetics ; 190(4): 1225-33, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-22267497

RESUMEN

The issue of heterozygosity continues to be a challenge in the analysis of genome sequences. In this article, we describe the use of allele ratios to distinguish biologically significant single-nucleotide variants from background noise. An application of this approach is the identification of lethal mutations in Caenorhabditis elegans essential genes, which must be maintained by the presence of a wild-type allele on a balancer. The h448 allele of let-504 is rescued by the duplication balancer sDp2. We readily identified the extent of the duplication when the percentage of read support for the lesion was between 70 and 80%. Examination of the EMS-induced changes throughout the genome revealed that these mutations exist in contiguous blocks. During early embryonic division in self-fertilizing C. elegans, alkylated guanines pair with thymines. As a result, EMS-induced changes become fixed as either G→A or C→T changes along the length of the chromosome. Thus, examination of the distribution of EMS-induced changes revealed the mutational and recombinational history of the chromosome, even generations later. We identified the mutational change responsible for the h448 mutation and sequenced PCR products for an additional four alleles, correlating let-504 with the DNA-coding region for an ortholog of a NFκB-activating protein, NKAP. Our results confirm that whole-genome sequencing is an efficient and inexpensive way of identifying nucleotide alterations responsible for lethal phenotypes and can be applied on a large scale to identify the molecular basis of essential genes.


Asunto(s)
Alelos , Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Frecuencia de los Genes , Genoma de los Helmintos , Mutación , Animales , Secuencia de Bases , Caenorhabditis elegans/efectos de los fármacos , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Mapeo Cromosómico , Cromosomas/efectos de los fármacos , Cromosomas/genética , Cromosomas/metabolismo , Cruzamientos Genéticos , Intercambio Genético , Duplicación de Gen , Genes Letales , Prueba de Complementación Genética , Organismos Hermafroditas/genética , Organismos Hermafroditas/metabolismo , Heterocigoto , Masculino , Mesilatos/efectos adversos , Mesilatos/farmacología , Organismos Modificados Genéticamente/genética , Organismos Modificados Genéticamente/metabolismo , Polimorfismo de Nucleótido Simple
10.
Genome Res ; 19(1): 143-9, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18838612

RESUMEN

BLAST is an extensively used local similarity search tool for identifying homologous sequences. When a gene sequence (either protein sequence or nucleotide sequence) is used as a query to search for homologous sequences in a genome, the search results, represented as a list of high-scoring pairs (HSPs), are fragments of candidate genes rather than full-length candidate genes. Relevant HSPs ("signals"), which represent candidate genes in the target genome sequences, are buried within a report that contains also hundreds to thousands of random HSPs ("noises"). Consequently, BLAST results are often overwhelming and confusing even to experienced users. For effective use of BLAST, a program is needed for extracting relevant HSPs that represent candidate homologous genes from the entire HSP report. To achieve this goal, we have designed a graph-based algorithm, genBlastA, which automatically filters HSPs into well-defined groups, each representing a candidate gene in the target genome. The novelty of genBlastA is an edge length metric that reflects a set of biologically motivated requirements so that each shortest path corresponds to an HSP group representing a homologous gene. We have demonstrated that this novel algorithm is both efficient and accurate for identifying homologous sequences, and that it outperforms existing approaches with similar functionalities.


Asunto(s)
Algoritmos , Alineación de Secuencia/estadística & datos numéricos , Programas Informáticos , Animales , Caenorhabditis elegans/genética , Bases de Datos Genéticas , Genoma de los Helmintos , Genómica/estadística & datos numéricos , Humanos , Homología de Secuencia de Ácido Nucleico
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA