RESUMO
De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.
Assuntos
Perfilação da Expressão Gênica/métodos , RNA/química , Software , Transcriptoma , Sequência de Bases , Schizosaccharomyces/genética , Proteínas de Schizosaccharomyces pombe/química , Proteínas de Schizosaccharomyces pombe/genética , Análise de Sequência de RNA/métodosRESUMO
Genomics information relating to human body lice is surprisingly scarce, and this has constrained studies of their physiology, immunology and vector biology. To identify novel body louse genes, we used engorged adult lice to generate a cDNA library. Initially, 1152 clones were screened for inserts, edited for removal of vector sequences and base pairs of poor quality, and viewed for splicing variations, gene families and polymorphism. Computational methods identified 506 inferred open reading frames including the first predicted louse defensin. The inferred defensin aligns well with other insect defensins and has highly conserved cysteine residues, as are known for other defensin sequences. Two cysteine and five serine proteinases were categorized according to their inferred catalytic sites. We also discovered seven putative ubiquitin-pathway genes and four iron metabolizing deduced enzymes. Finally, glutathione-S-transferases and cytochrome P450 genes were among the detoxification enzymes found. Results from this first systematic effort to discover human body louse genes should promote further studies in Phthiraptera and lice.