Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Mais filtros

Base de dados
Ano de publicação
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Brief Bioinform ; 21(2): 676-686, 2020 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-30815667

RESUMO

A widely used approach in transcriptome analysis is the alignment of short reads to a reference genome. However, owing to the deficiencies of specially designed analytical systems, short reads unmapped to the genome sequence are usually ignored, resulting in the loss of significant biological information and insights. To fill this gap, we present Comprehensive Assembly and Functional annotation of Unmapped RNA-Seq data (CAFU), a Galaxy-based framework that can facilitate the large-scale analysis of unmapped RNA sequencing (RNA-Seq) reads from single- and mixed-species samples. By taking advantage of machine learning techniques, CAFU addresses the issue of accurately identifying the species origin of transcripts assembled using unmapped reads from mixed-species samples. CAFU also represents an innovation in that it provides a comprehensive collection of functions required for transcript confidence evaluation, coding potential calculation, sequence and expression characterization and function annotation. These functions and their dependencies have been integrated into a Galaxy framework that provides access to CAFU via a user-friendly interface, dramatically simplifying complex exploration tasks involving unmapped RNA-Seq reads. CAFU has been validated with RNA-Seq data sets from wheat and Zea mays (maize) samples. CAFU is freely available via GitHub: https://github.com/cma2015/CAFU.


Assuntos
Biologia Computacional/métodos , Análise de Sequência de RNA/métodos , Genes de Plantas , Humanos , RNA Mensageiro/genética , Triticum/genética , Interface Usuário-Computador , Zea mays/genética
2.
Planta ; 247(3): 745-760, 2018 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-29196940

RESUMO

MAIN CONCLUSION: A large-scale bioinformatics analysis revealed the origin and evolution of GT47 gene family, and identified two clades of intron-poor genes with putative functions in drought stress responses and seed development in maize. Glycosyltransferase family 47 (GT47) genes encode ß-galactosyltransferases and ß-glucuronyltransferases that synthesize pectin, xyloglucans and xylan, which are important components of the plant cell wall. In this study, we performed a systematic and large-scale bioinformatics analysis of GT47 gene family using 352 GT47 proteins from 15 species ranging from cyanobacteria to seed plants. The analysis results showed that GT47 family may originate in cyanobacteria and expand along the evolutionary trajectory to moss. Further analysis of 47 GT47 genes in maize revealed that they can divide into five clades with diverse exon-intron structures. Among these five clades, two were mainly composed with intron-poor genes, which may originate in the moss. Gene duplication analysis revealed that the expansion of GT47 gene family in maize was significantly driven from tandem duplication events and segmental duplication events. Significantly, almost all duplicated genes are intron-poor genes. Expression analysis indicated that several intron-poor GT47 genes may be involved in the drought stress response and seed development in maize. This work provides insight into the origin and evolutionary process, expansion mechanisms and expression patterns of GT47 genes, thus facilitating their functional investigations in the future.


Assuntos
Evolução Molecular , Glicosiltransferases/genética , Íntrons/genética , Briófitas/enzimologia , Briófitas/genética , Cianobactérias/enzimologia , Cianobactérias/genética , Desidratação/genética , Regulação da Expressão Gênica de Plantas/genética , Genes de Plantas/genética , Filogenia , Plantas/genética , Alinhamento de Sequência , Zea mays/enzimologia , Zea mays/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA