Your browser doesn't support javascript.
loading
ORI-Deep: improving the accuracy for predicting origin of replication sites by using a blend of features and long short-term memory network.
Shahid, Mahwish; Ilyas, Maham; Hussain, Waqar; Khan, Yaser Daanial.
Afiliação
  • Shahid M; School of Systems and Technologies, University of Management and Technology, Lahore, Pakistan.
  • Ilyas M; University of Management and Technology, Lahore, Pakistan.
  • Hussain W; University of Management and Technology, Lahore, Pakistan.
  • Khan YD; Department of Computer Science, University of Management and Technology, Lahore, Pakistan.
Brief Bioinform ; 23(2)2022 03 10.
Article em En | MEDLINE | ID: mdl-35048955
ABSTRACT
Replication of DNA is an important process for the cell division cycle, gene expression regulation and other biological evolution processes. It also has a crucial role in a living organism's physical growth and structure. Replication of DNA comprises of three stages known as initiation, elongation and termination, whereas the origin of replication sites (ORI) is the location of initiation of the DNA replication process. There exist various methodologies to identify ORIs in the genomic sequences, however, these methods have used either extensive computations for execution, or have limited optimization for the large datasets. Herein, a model called ORI-Deep is proposed to identify ORIs from the multiple cell type genomic sequence benchmark data. An efficient method is proposed using a deep neural network to identify ORIs for four different eukaryotic species. For better representation of data, a feature vector is constructed using statistical moments for the training and testing of data and is further fed to a long short-term memory (LSTM) network. To prove the effectiveness of the proposed model, we applied several validation techniques at different levels to obtain seven accuracy metrics, and the accuracy score for self-consistency, 10-fold cross-validation, jackknife and the independent set test is observed to be 0.977, 0.948, 0.976 and 0.977, respectively. Based on the results, it can be concluded that ORI-Deep can efficiently predict the sites of origin replication in DNA sequence with high accuracy. Webserver for ORI-Deep is available at (https//share.streamlit.io/waqarhusain/orideep/main/app.py), whereas source code is available at (https//github.com/WaqarHusain/OriDeep).
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Origem de Replicação / Memória de Curto Prazo Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Paquistão

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Origem de Replicação / Memória de Curto Prazo Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Paquistão