Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
BMC Bioinformatics ; 24(1): 133, 2023 Apr 04.
Artículo en Inglés | MEDLINE | ID: mdl-37016291

RESUMEN

BACKGROUND: RNA-seq followed by de novo transcriptome assembly has been a transformative technique in biological research of non-model organisms, but the computational processing of RNA-seq data entails many different software tools. The complexity of these de novo transcriptomics workflows therefore presents a major barrier for researchers to adopt best-practice methods and up-to-date versions of software. RESULTS: Here we present a streamlined and universal de novo transcriptome assembly and annotation pipeline, transXpress, implemented in Snakemake. transXpress supports two popular assembly programs, Trinity and rnaSPAdes, and allows parallel execution on heterogeneous cluster computing hardware. CONCLUSIONS: transXpress simplifies the use of best-practice methods and up-to-date software for de novo transcriptome assembly, and produces standardized output files that can be mined using SequenceServer to facilitate rapid discovery of new genes and proteins in non-model organisms.


Asunto(s)
Programas Informáticos , Transcriptoma , Análisis de Secuencia de ARN/métodos , RNA-Seq , Perfilación de la Expresión Génica , Anotación de Secuencia Molecular
2.
Mol Biol Evol ; 38(12): 5255-5274, 2021 12 09.
Artículo en Inglés | MEDLINE | ID: mdl-34410426

RESUMEN

Hybridization and genome duplication have played crucial roles in the evolution of many animal and plant taxa. The subgenomes of parental species undergo considerable changes in hybrids and polyploids, which often selectively eliminate segments of one subgenome. However, the mechanisms underlying these changes are not well understood, particularly when the hybridization is linked with asexual reproduction that opens up unexpected evolutionary pathways. To elucidate this problem, we compared published cytogenetic and RNAseq data with exome sequences of asexual diploid and polyploid hybrids between three fish species; Cobitis elongatoides, C. taenia, and C. tanaitica. Clonal genomes remained generally static at chromosome-scale levels but their heterozygosity gradually deteriorated at the level of individual genes owing to allelic deletions and conversions. Interestingly, the impact of both processes varies among animals and genomic regions depending on ploidy level and the properties of affected genes. Namely, polyploids were more tolerant to deletions than diploid asexuals where conversions prevailed, and genomic restructuring events accumulated preferentially in genes characterized by high transcription levels and GC-content, strong purifying selection and specific functions like interacting with intracellular membranes. Although hybrids were phenotypically more similar to C. taenia, we found that they preferentially retained C. elongatoides alleles. This demonstrates that favored subgenome is not necessarily the transcriptionally dominant one. This study demonstrated that subgenomes in asexual hybrids and polyploids evolve under a complex interplay of selection and several molecular mechanisms whose efficiency depends on the organism's ploidy level, as well as functional properties and parental ancestry of the genomic region.


Asunto(s)
Cipriniformes , Poliploidía , Animales , Cipriniformes/genética , Diploidia , Evolución Molecular , Genoma de Planta , Hibridación Genética , Pérdida de Heterocigocidad
3.
Mutagenesis ; 36(5): 358-368, 2021 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-34214148

RESUMEN

Cell-free DNA (cfDNA) has recently been used as a non-invasive diagnostic tool for detecting tumour-specific mutations. cfDNA may also be used for monitoring disease progression and treatment response, but so far researchers focused on one or few genes only. A genomic profile may provide better information on patient prognosis compared to single specific mutations. In this hypothesis-generating study, we profiled by whole exome sequencing serial plasma samples from 10 colon cancer (CC) patients collected before and after 5-fluorouracil-based therapy, and one year after diagnosis to determine alterations associated with treatment response. In parallel, genome profiling was also performed in patients' corresponding tumour tissue to ascertain the molecular landscape of resistant tumours. The mutation concordance between cfDNA and tumour tissue DNA was higher in more advanced tumour stages than in the early stages of the disease. In non-responders, a specific mutation profile was observed in tumour tissues (TPSD1 p.Ala92Thr, CPAMD8 p.Arg341Gln, OBP2A p.ArgTyr123CysHis). A pathogenic APC mutation (p.Ser1315Ter) was detected only in cfDNA of one poor responder one year after the diagnosis and after therapy termination. Another poor responder presented a likely pathogenic TP53 mutation (p.Arg110Pro) in cfDNA of all plasma samplings and in tumour tissue. In conclusion, cfDNA could be used for genetic characterisation of CC patients and might be clinically useful for non-invasive therapy response monitoring.


Asunto(s)
Biomarcadores de Tumor , Ácidos Nucleicos Libres de Células , Neoplasias del Colon/diagnóstico , Neoplasias del Colon/genética , ADN de Neoplasias , Mutación , Anciano , Neoplasias del Colon/sangre , Neoplasias del Colon/terapia , Femenino , Fluorouracilo/farmacología , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Estadificación de Neoplasias , Pronóstico , Análisis de Secuencia de ADN
4.
Parasitology ; 144(7): 975-983, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28241893

RESUMEN

Self/non-self-discrimination by vertebrate immune systems is based on the recognition of the presence of peptides in proteins of a parasite that are not contained in the proteins of a host. Therefore, a reduction of the number of 'words' in its own peptide vocabulary could be an efficient evolutionary strategy of parasites for escaping recognition. Here, we compared peptide vocabularies of 30 endoparasitic and 17 free-living unicellular organisms and also eight multicellular parasitic and 16 multicellular free-living organisms. We found that both unicellular and multicellular parasites used a significantly lower number of different pentapeptides than free-living controls. Impoverished pentapeptide vocabularies in parasites were observed across all five clades that contain both the parasitic and free-living species. The effect of parasitism on a number of peptides used in an organism's proteins is larger than effects of all other studied factors, including the size of a proteome, the number of encoded proteins, etc. This decrease of pentapeptide diversity was partly compensated for by an increased number of hexapeptides. Our results support the hypothesis of parasitism-associated reduction of peptide vocabulary and suggest that T-cell receptors mostly recognize the five amino acids-long part of peptides that are presented in the groove of major histocompatibility complex molecules.


Asunto(s)
Evolución Biológica , Proteínas del Helminto/análisis , Interacciones Huésped-Parásitos , Péptidos/análisis , Proteoma , Proteínas Protozoarias/análisis , Animales
6.
Nucleic Acids Res ; 38(Database issue): D131-6, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19917642

RESUMEN

The IRESite (http://www.iresite.org) presents carefully curated experimental evidence of many eukaryotic viral and cellular internal ribosome entry site (IRES) regions. At the time of submission, IRESite stored >600 records. The IRESite gradually evolved into a robust tool providing (i) biologically meaningful information regarding the IRESs and their experimental background (including annotation of IRES secondary structures and IRES trans-acting factors) as well as (ii) thorough concluding remarks to stored database entries and regularly updated evaluation of the reported IRES function. A substantial portion of the IRESite data results purely from in-house bioinformatic analyses of currently available sequences, in silico attempts to repeat published cloning experiments, DNA sequencing and restriction endonuclease verification of received plasmid DNA. We also present a newly implemented tool for displaying RNA secondary structures and for searching through the structures currently stored in the database. The supplementary material contains an updated list of reported IRESs.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , ARN Viral/genética , Ribosomas/genética , Animales , Biología Computacional/tendencias , Genoma Viral , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Conformación de Ácido Nucleico , Iniciación de la Cadena Peptídica Traduccional/genética , Plásmidos/metabolismo , Análisis de Secuencia de ADN , Programas Informáticos
7.
Proc Natl Acad Sci U S A ; 106(6): 1954-9, 2009 Feb 10.
Artículo en Inglés | MEDLINE | ID: mdl-19193860

RESUMEN

Brown-rot fungi such as Postia placenta are common inhabitants of forest ecosystems and are also largely responsible for the destructive decay of wooden structures. Rapid depolymerization of cellulose is a distinguishing feature of brown-rot, but the biochemical mechanisms and underlying genetics are poorly understood. Systematic examination of the P. placenta genome, transcriptome, and secretome revealed unique extracellular enzyme systems, including an unusual repertoire of extracellular glycoside hydrolases. Genes encoding exocellobiohydrolases and cellulose-binding domains, typical of cellulolytic microbes, are absent in this efficient cellulose-degrading fungus. When P. placenta was grown in medium containing cellulose as sole carbon source, transcripts corresponding to many hemicellulases and to a single putative beta-1-4 endoglucanase were expressed at high levels relative to glucose-grown cultures. These transcript profiles were confirmed by direct identification of peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS). Also up-regulated during growth on cellulose medium were putative iron reductases, quinone reductase, and structurally divergent oxidases potentially involved in extracellular generation of Fe(II) and H(2)O(2). These observations are consistent with a biodegradative role for Fenton chemistry in which Fe(II) and H(2)O(2) react to form hydroxyl radicals, highly reactive oxidants capable of depolymerizing cellulose. The P. placenta genome resources provide unparalleled opportunities for investigating such unusual mechanisms of cellulose conversion. More broadly, the genome offers insight into the diversification of lignocellulose degrading mechanisms in fungi. Comparisons with the closely related white-rot fungus Phanerochaete chrysosporium support an evolutionary shift from white-rot to brown-rot during which the capacity for efficient depolymerization of lignin was lost.


Asunto(s)
Perfilación de la Expresión Génica , Genoma Fúngico , Lignina/metabolismo , Redes y Vías Metabólicas/genética , Polyporales/genética , Secuencia de Bases , Evolución Biológica , Celulasas , Enzimas/genética , Glicósido Hidrolasas , Datos de Secuencia Molecular , Oxidorreductasas , Polyporales/metabolismo , Madera/metabolismo
8.
RNA ; 14(9): 1720-9, 2008 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-18697919

RESUMEN

A firefly luciferase (FLuc) counts among the most popular reporters of present-day molecular and cellular biology. In this study, we report a cryptic promoter activity in the luc+ gene, which is the most frequently used version of the firefly luciferase. The FLuc coding region displays cryptic promoter activity both in mammalian and yeast cells. In human CCL13 and Huh7 cells, cryptic transcription from the luc+ gene is 10-16 times weaker in comparison to the strong immediate-early cytomegalovirus promoter. Additionally, we discuss a possible impact of the FLuc gene cryptic promoter on experimental results especially in some fields of the RNA-oriented research, for example, in analysis of translation initiation or analysis of miRNA/siRNA function. Specifically, we propose how this newly described cryptic promoter activity within the FLuc gene might contribute to the previous determination of the strength of the cryptic promoter found in the cDNA corresponding to the hepatitis C virus internal ribosome entry site. Our findings should appeal to the researchers to be more careful when designing firefly luciferase-based assays as well as open the possibility of performing some experiments with the hepatitis C virus internal ribosome entry site, which could not be considered until now.


Asunto(s)
Regulación de la Expresión Génica , Luciferasas de Luciérnaga/genética , Regiones Promotoras Genéticas , Transcripción Genética , Animales , Línea Celular , ADN Complementario/genética , Hepacivirus/genética , Humanos
9.
Nucleic Acids Res ; 34(Database issue): D125-30, 2006 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-16381829

RESUMEN

IRESite is an exhaustive, manually annotated non-redundant relational database focused on the IRES elements (Internal Ribosome Entry Site) and containing information not available in the primary public databases. IRES elements were originally found in eukaryotic viruses hijacking initiation of translation of their host. Later on, they were also discovered in 5'-untranslated regions of some eukaryotic mRNA molecules. Currently, IRESite presents up to 92 biologically relevant aspects of every experiment, e.g. the nature of an IRES element, its functionality/defectivity, origin, size, sequence, structure, its relative position with respect to surrounding protein coding regions, positive/negative controls used in the experiment, the reporter genes used to monitor IRES activity, the measured reporter protein yields/activities, and references to original publications as well as cross-references to other databases, and also comments from submitters and our curators. Furthermore, the site presents the known similarities to rRNA sequences as well as RNA-protein interactions. Special care is given to the annotation of promoter-like regions. The annotated data in IRESite are bound to mostly complete, full-length mRNA, and whenever possible, accompanied by original plasmid vector sequences. New data can be submitted through the publicly available web-based interface at http://www.iresite.org and are curated by a team of lab-experienced biologists.


Asunto(s)
Regiones no Traducidas 5'/química , Bases de Datos de Ácidos Nucleicos , Iniciación de la Cadena Peptídica Traduccional , Internet , Factores de Iniciación de Péptidos/metabolismo , Plásmidos/química , Regiones Promotoras Genéticas , ARN Mensajero/química , ARN Viral/química , Secuencias Reguladoras de Ácido Ribonucleico , Interfaz Usuario-Computador
10.
Nucleic Acids Res ; 32(18): 5539-45, 2004.
Artículo en Inglés | MEDLINE | ID: mdl-15486203

RESUMEN

In this paper, we present the Functional Catalogue (FunCat), a hierarchically structured, organism-independent, flexible and scalable controlled classification system enabling the functional description of proteins from any organism. FunCat has been applied for the manual annotation of prokaryotes, fungi, plants and animals. We describe how FunCat is implemented as a highly efficient and robust tool for the manual and automatic annotation of genomic sequences. Owing to its hierarchical architecture, FunCat has also proved to be useful for many subsequent downstream bioinformatic applications. This is illustrated by the analysis of large-scale experiments from various investigations in transcriptomics and proteomics, where FunCat was used to project experimental data into functional units, as 'gold standard' for functional classification methods, and also served to compare the significance of different experimental methods. Over the last decade, the FunCat has been established as a robust and stable annotation scheme that offers both, meaningful and manageable functional classification as well as ease of perception.


Asunto(s)
Biología Computacional/métodos , Genoma , Proteínas/clasificación , Proteínas/metabolismo , Proteómica/métodos , Programas Informáticos , Indización y Redacción de Resúmenes , Animales , Automatización/instrumentación , Automatización/métodos , Biología Computacional/instrumentación , Genómica/instrumentación , Genómica/métodos , Internet , Unión Proteica , Proteínas/genética , Proteoma/clasificación , Proteoma/genética , Proteoma/metabolismo , Proteómica/instrumentación , Reproducibilidad de los Resultados , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/clasificación , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Terminología como Asunto , Transcripción Genética/genética
11.
Nucleic Acids Res ; 31(1): 207-11, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12519983

RESUMEN

The PEDANT genome database (http://pedant.gsf.de) provides exhaustive automatic analysis of genomic sequences by a large variety of established bioinformatics tools through a comprehensive Web-based user interface. One hundred and seventy seven completely sequenced and unfinished genomes have been processed so far, including large eukaryotic genomes (mouse, human) published recently. In this contribution, we describe the current status of the PEDANT database and novel analytical features added to the PEDANT server in 2002. Those include: (i) integration with the BioRS data retrieval system which allows fast text queries, (ii) pre-computed sequence clusters in each complete genome, (iii) a comprehensive set of tools for genome comparison, including genome comparison tables and protein function prediction based on genomic context, and (iv) computation and visualization of protein-protein interaction (PPI) networks based on experimental data. The availability of functional and structural predictions for 650 000 genomic proteins in well organized form makes PEDANT a useful resource for both functional and structural genomics.


Asunto(s)
Bases de Datos Genéticas , Genoma , Animales , Biología Computacional , Duplicación de Gen , Genómica , Humanos , Almacenamiento y Recuperación de la Información , Ratones , Mapeo de Interacción de Proteínas , Proteínas/química , Proteínas/fisiología , Proteínas de Saccharomyces cerevisiae/metabolismo , Análisis de Secuencia de Proteína , Homología de Secuencia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA