Your browser doesn't support javascript.
loading
SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model.
Zhang, Yikang; Chu, Xiaomin; Jiang, Yelu; Wu, Hongjie; Quan, Lijun.
Afiliación
  • Zhang Y; School of Computer Science and Technology, Soochow University, Suzhou 215006, China.
  • Chu X; Jiangsu Province Key Lab for Information Processing Technologies, Soochow University, Suzhou 215006, China.
  • Jiang Y; School of Computer Science and Technology, Soochow University, Suzhou 215006, China.
  • Wu H; School of Computer Science and Technology, Soochow University, Suzhou 215006, China.
  • Quan L; School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China.
Genes (Basel) ; 13(4)2022 03 23.
Article en En | MEDLINE | ID: mdl-35456374
A large number of inorganic and organic compounds are able to bind DNA and form complexes, among which drug-related molecules are important. Chromatin accessibility changes not only directly affect drug-DNA interactions, but they can promote or inhibit the expression of the critical genes associated with drug resistance by affecting the DNA binding capacity of TFs and transcriptional regulators. However, the biological experimental techniques for measuring it are expensive and time-consuming. In recent years, several kinds of computational methods have been proposed to identify accessible regions of the genome. Existing computational models mostly ignore the contextual information provided by the bases in gene sequences. To address these issues, we proposed a new solution called SemanticCAP. It introduces a gene language model that models the context of gene sequences and is thus able to provide an effective representation of a certain site in a gene sequence. Basically, we merged the features provided by the gene language model into our chromatin accessibility model. During the process, we designed methods called SFA and SFC to make feature fusion smoother. Compared to DeepSEA, gkm-SVM, and k-mer using public benchmarks, our model proved to have better performance, showing a 1.25% maximum improvement in auROC and a 2.41% maximum improvement in auPRC.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Cromatina / Lenguaje Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Genes (Basel) Año: 2022 Tipo del documento: Article País de afiliación: China Pais de publicación: Suiza

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Cromatina / Lenguaje Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Genes (Basel) Año: 2022 Tipo del documento: Article País de afiliación: China Pais de publicación: Suiza