Your browser doesn't support javascript.
loading
MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.
Zhang, Meng; Li, Fuyi; Marquez-Lago, Tatiana T; Leier, André; Fan, Cunshuo; Kwoh, Chee Keong; Chou, Kuo-Chen; Song, Jiangning; Jia, Cangzhi.
Afiliación
  • Zhang M; School of Science, Dalian Maritime University, Dalian, China.
  • Li F; Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology.
  • Marquez-Lago TT; Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia.
  • Leier A; Department of Genetics, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
  • Fan C; Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
  • Kwoh CK; Department of Genetics, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
  • Chou KC; Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
  • Song J; College of Information Engineering, Northwest A&F University, Yangling, China.
  • Jia C; School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore.
Bioinformatics ; 35(17): 2957-2965, 2019 09 01.
Article en En | MEDLINE | ID: mdl-30649179
MOTIVATION: Promoters are short DNA consensus sequences that are localized proximal to the transcription start sites of genes, allowing transcription initiation of particular genes. However, the precise prediction of promoters remains a challenging task because individual promoters often differ from the consensus at one or more positions. RESULTS: In this study, we present a new multi-layer computational approach, called MULTiPly, for recognizing promoters and their specific types. MULTiPly took into account the sequences themselves, including both local information such as k-tuple nucleotide composition, dinucleotide-based auto covariance and global information of the entire samples based on bi-profile Bayes and k-nearest neighbour feature encodings. Specifically, the F-score feature selection method was applied to identify the best unique type of feature prediction results, in combination with other types of features that were subsequently added to further improve the prediction performance of MULTiPly. Benchmarking experiments on the benchmark dataset and comparisons with five state-of-the-art tools show that MULTiPly can achieve a better prediction performance on 5-fold cross-validation and jackknife tests. Moreover, the superiority of MULTiPly was also validated on a newly constructed independent test dataset. MULTiPly is expected to be used as a useful tool that will facilitate the discovery of both general and specific types of promoters in the post-genomic era. AVAILABILITY AND IMPLEMENTATION: The MULTiPly webserver and curated datasets are freely available at http://flagshipnt.erc.monash.edu/MULTiPly/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Regiones Promotoras Genéticas / Genómica Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Regiones Promotoras Genéticas / Genómica Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: China