Your browser doesn't support javascript.
loading
Tissue Expression Difference between mRNAs and lncRNAs.
Chen, Lei; Zhang, Yu-Hang; Pan, Xiaoyong; Liu, Min; Wang, Shaopeng; Huang, Tao; Cai, Yu-Dong.
Afiliação
  • Chen L; School of Life Sciences, Shanghai University, Shanghai 200444, China. chen_lei1@163.com.
  • Zhang YH; College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China. chen_lei1@163.com.
  • Pan X; Shanghai Key Laboratory of PMMP, East China Normal University, Shanghai 200241, China. chen_lei1@163.com.
  • Liu M; Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China. zhangyh825@163.com.
  • Wang S; Department of Medical Informatics, Erasmus MC, 3000 CA Rotterdam, The Netherlands. xypan172436@gmail.com.
  • Huang T; College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China. liumin@shmtu.edu.cn.
  • Cai YD; School of Life Sciences, Shanghai University, Shanghai 200444, China. wsptfb@163.com.
Int J Mol Sci ; 19(11)2018 Oct 31.
Article em En | MEDLINE | ID: mdl-30384456
ABSTRACT
Messenger RNA (mRNA) and long noncoding RNA (lncRNA) are two main subgroups of RNAs participating in transcription regulation. With the development of next generation sequencing, increasing lncRNAs are identified. Many hidden functions of lncRNAs are also revealed. However, the differences in lncRNAs and mRNAs are still unclear. For example, we need to determine whether lncRNAs have stronger tissue specificity than mRNAs and which tissues have more lncRNAs expressed. To investigate such tissue expression difference between mRNAs and lncRNAs, we encoded 9339 lncRNAs and 14,294 mRNAs with 71 expression features, including 69 maximum expression features for 69 types of cells, one feature for the maximum expression in all cells, and one expression specificity feature that was measured as Chao-Shen-corrected Shannon's entropy. With advanced feature selection methods, such as maximum relevance minimum redundancy, incremental feature selection methods, and random forest algorithm, 13 features presented the dissimilarity of lncRNAs and mRNAs. The 11 cell subtype features indicated which cell types of the lncRNAs and mRNAs had the largest expression difference. Such cell subtypes may be the potential cell models for lncRNA identification and function investigation. The expression specificity feature suggested that the cell types to express mRNAs and lncRNAs were different. The maximum expression feature suggested that the maximum expression levels of mRNAs and lncRNAs were different. In addition, the rule learning algorithm, repeated incremental pruning to produce error reduction algorithm, was also employed to produce effective classification rules for classifying lncRNAs and mRNAs, which gave competitive results compared with random forest and could give a clearer picture of different expression patterns between lncRNAs and mRNAs. Results not only revealed the heterogeneous expression pattern of lncRNA and mRNA, but also gave rise to the development of a new tool to identify the potential biological functions of such RNA subgroups.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA Mensageiro / Regulação da Expressão Gênica / Bases de Dados de Ácidos Nucleicos / RNA Longo não Codificante Limite: Animals / Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA Mensageiro / Regulação da Expressão Gênica / Bases de Dados de Ácidos Nucleicos / RNA Longo não Codificante Limite: Animals / Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article