Your browser doesn't support javascript.
loading
VPatho: a deep learning-based two-stage approach for accurate prediction of gain-of-function and loss-of-function variants.
Ge, Fang; Li, Chen; Iqbal, Shahid; Muhammad, Arif; Li, Fuyi; Thafar, Maha A; Yan, Zihao; Worachartcheewan, Apilak; Xu, Xiaofeng; Song, Jiangning; Yu, Dong-Jun.
Afiliación
  • Ge F; School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China.
  • Li C; School of Computer Science and Information Engineering, Bengbu University, 1866 Caoshan Road, Bengbu, 233030, China.
  • Iqbal S; Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
  • Muhammad A; Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
  • Li F; Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia.
  • Thafar MA; Department of Community Medical Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
  • Yan Z; Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Victoria, Australia.
  • Worachartcheewan A; College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Xu X; Department of Computer Science, College of Computers and Information Technology, Taif University, P.O.Box 110099, Taif 21944, Saudi Arabia.
  • Song J; School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China.
  • Yu DJ; Department of Community Medical Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
Brief Bioinform ; 24(1)2023 01 19.
Article en En | MEDLINE | ID: mdl-36528806
ABSTRACT
Determining the pathogenicity and functional impact (i.e. gain-of-function; GOF or loss-of-function; LOF) of a variant is vital for unraveling the genetic level mechanisms of human diseases. To provide a 'one-stop' framework for the accurate identification of pathogenicity and functional impact of variants, we developed a two-stage deep-learning-based computational solution, termed VPatho, which was trained using a total of 9619 pathogenic GOF/LOF and 138 026 neutral variants curated from various databases. A total number of 138 variant-level, 262 protein-level and 103 genome-level features were extracted for constructing the models of VPatho. The development of VPatho consists of two stages (i) a random under-sampling multi-scale residual neural network (ResNet) with a newly defined weighted-loss function (RUS-Wg-MSResNet) was proposed to predict variants' pathogenicity on the gnomAD_NV + GOF/LOF dataset; and (ii) an XGBOD model was constructed to predict the functional impact of the given variants. Benchmarking experiments demonstrated that RUS-Wg-MSResNet achieved the highest prediction performance with the weights calculated based on the ratios of neutral versus pathogenic variants. Independent tests showed that both RUS-Wg-MSResNet and XGBOD achieved outstanding performance. Moreover, assessed using variants from the CAGI6 competition, RUS-Wg-MSResNet achieved superior performance compared to state-of-the-art predictors. The fine-trained XGBOD models were further used to blind test the whole LOF data downloaded from gnomAD and accordingly, we identified 31 nonLOF variants that were previously labeled as LOF/uncertain variants. As an implementation of the developed approach, a webserver of VPatho is made publicly available at http//csbio.njust.edu.cn/bioinf/vpatho/ to facilitate community-wide efforts for profiling and prioritizing the query variants with respect to their pathogenicity and functional impact.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China