Your browser doesn't support javascript.
loading
Detection of long non-coding RNA homology, a comparative study on alignment and alignment-free metrics.
Noviello, Teresa M R; Di Liddo, Antonella; Ventola, Giovanna M; Spagnuolo, Antonietta; D'Aniello, Salvatore; Ceccarelli, Michele; Cerulo, Luigi.
Afiliação
  • Noviello TMR; Dep. of Science and Technology, University of Sannio, via Port'Arsa, 11, Benevento, 82100, Italy.
  • Di Liddo A; BioGeM, Institute of Genetic Research "Gaetano Salvatore", Camporeale, Ariano Irpino (AV), 83031, Italy.
  • Ventola GM; Buchmann Institute for Molecular Life Sciences, Goethe University, Max-von-Laue-Straße 13, Frankfurt am Main, 60438, Germany.
  • Spagnuolo A; Genomix4Life S.r.l., Via Salvador Allende, Baronissi (SA), 84081, Italy.
  • D'Aniello S; Dep. of Biology and Evolution of Marine Organisms, Stazione Zoologica "A. Dohrn", Villa Comunale, Napoli, 80121, Italy.
  • Ceccarelli M; Dep. of Biology and Evolution of Marine Organisms, Stazione Zoologica "A. Dohrn", Villa Comunale, Napoli, 80121, Italy.
  • Cerulo L; Dep. of Science and Technology, University of Sannio, via Port'Arsa, 11, Benevento, 82100, Italy.
BMC Bioinformatics ; 19(1): 407, 2018 Nov 06.
Article em En | MEDLINE | ID: mdl-30400819
ABSTRACT

BACKGROUND:

Long non-coding RNAs (lncRNAs) represent a novel class of non-coding RNAs having a crucial role in many biological processes. The identification of long non-coding homologs among different species is essential to investigate such roles in model organisms as homologous genes tend to retain similar molecular and biological functions. Alignment-based metrics are able to effectively capture the conservation of transcribed coding sequences and then the homology of protein coding genes. However, unlike protein coding genes the poor sequence conservation of long non-coding genes makes the identification of their homologs a challenging task.

RESULTS:

In this study we compare alignment-based and alignment-free string similarity metrics and look at promoter regions as a possible source of conserved information. We show that promoter regions encode relevant information for the conservation of long non-coding genes across species and that such information is better captured by alignment-free metrics. We perform a genome wide test of this hypothesis in human, mouse, and zebrafish.

CONCLUSIONS:

The obtained results persuaded us to postulate the new hypothesis that, unlike protein coding genes, long non-coding genes tend to preserve their regulatory machinery rather than their transcribed sequence. All datasets, scripts, and the prediction tools adopted in this study are available at https//github.com/bioinformatics-sannio/lncrna-homologs .
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Regulação da Expressão Gênica / Alinhamento de Sequência / Genoma / Sequência Conservada / RNA Longo não Codificante Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Animals / Humans Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Itália

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Regulação da Expressão Gênica / Alinhamento de Sequência / Genoma / Sequência Conservada / RNA Longo não Codificante Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Animals / Humans Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Itália