Your browser doesn't support javascript.
loading
Identification of DNase I hypersensitive sites in the human genome by multiple sequence descriptors.
Jin, Yan-Ting; Tan, Yang; Gan, Zhong-Hua; Hao, Yu-Duo; Wang, Tian-Yu; Lin, Hao; Tang, Bo.
Afiliação
  • Jin YT; School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China. Electronic address: jinyanting@uestc.edu.cn.
  • Tan Y; School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China.
  • Gan ZH; Department of Pathology, The Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, Sichuan, China.
  • Hao YD; School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China. Electronic address: 202221140528@std.uestc.edu.cn.
  • Wang TY; School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China.
  • Lin H; School of Life Science and Technology, University of Electronic Science and Technology of China, 611731 Chengdu, China. Electronic address: hlin@uestc.edu.cn.
  • Tang B; Department of Pathology, The Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, Sichuan, China. Electronic address: 13438585758@163.com.
Methods ; 229: 125-132, 2024 Jul 02.
Article em En | MEDLINE | ID: mdl-38964595
ABSTRACT
DNase I hypersensitive sites (DHSs) are chromatin regions highly sensitive to DNase I enzymes. Studying DHSs is crucial for understanding complex transcriptional regulation mechanisms and localizing cis-regulatory elements (CREs). Numerous studies have indicated that disease-related loci are often enriched in DHSs regions, underscoring the importance of identifying DHSs. Although wet experiments exist for DHSs identification, they are often labor-intensive. Therefore, there is a strong need to develop computational methods for this purpose. In this study, we used experimental data to construct a benchmark dataset. Seven feature extraction methods were employed to capture information about human DHSs. The F-score was applied to filter the features. By comparing the prediction performance of various classification algorithms through five-fold cross-validation, random forest was proposed to perform the final model construction. The model could produce an overall prediction accuracy of 0.859 with an AUC value of 0.837. We hope that this model can assist scholars conducting DNase research in identifying these sites.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article