Your browser doesn't support javascript.
loading
Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models.
Pande, Akshara; Patiyal, Sumeet; Lathwal, Anjali; Arora, Chakit; Kaur, Dilraj; Dhall, Anjali; Mishra, Gaurav; Kaur, Harpreet; Sharma, Neelam; Jain, Shipra; Usmani, Salman Sadullah; Agrawal, Piyush; Kumar, Rajesh; Kumar, Vinod; Raghava, Gajendra P S.
Afiliação
  • Pande A; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Patiyal S; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Lathwal A; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Arora C; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Kaur D; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Dhall A; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Mishra G; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Kaur H; Department of Electrical Engineering, Shiv Nadar University, Greater Noida, India.
  • Sharma N; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Jain S; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India.
  • Usmani SS; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Agrawal P; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Kumar R; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
  • Kumar V; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India.
  • Raghava GPS; Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.
J Comput Biol ; 30(2): 204-222, 2023 02.
Article em En | MEDLINE | ID: mdl-36251780
ABSTRACT
In the last three decades, a wide range of protein features have been discovered to annotate a protein. Numerous attempts have been made to integrate these features in a software package/platform so that the user may compute a wide range of features from a single source. To complement the existing methods, we developed a method, Pfeature, for computing a wide range of protein features. Pfeature allows to compute more than 200,000 features required for predicting the overall function of a protein, residue-level annotation of a protein, and function of chemically modified peptides. It has six major modules, namely, composition, binary profiles, evolutionary information, structural features, patterns, and model building. Composition module facilitates to compute most of the existing compositional features, plus novel features. The binary profile of amino acid sequences allows to compute the fraction of each type of residue as well as its position. The evolutionary information module allows to compute evolutionary information of a protein in the form of a position-specific scoring matrix profile generated using Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST); fit for annotation of a protein and its residues. A structural module was developed for computing of structural features/descriptors from a tertiary structure of a protein. These features are suitable to predict the therapeutic potential of a protein containing non-natural or chemically modified residues. The model-building module allows to implement various machine learning techniques for developing classification and regression models as well as feature selection. Pfeature also allows the generation of overlapping patterns and features from a protein. A user-friendly Pfeature is available as a web server python library and stand-alone package.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Proteínas Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Proteínas Idioma: En Ano de publicação: 2023 Tipo de documento: Article