Pesquisa | Secretaria de Estado da Saúde

1.

Drug repurposing for COVID-19 using computational screening: Is Fostamatinib/R406 a potential candidate?

Saha, Sovan; Halder, Anup Kumar; Bandyopadhyay, Soumyendu Sekhar; Chatterjee, Piyali; Nasipuri, Mita; Bose, Debdas; Basu, Subhadip.

Methods ; 203: 564-574, 2022 07.

Artigo em Inglês | MEDLINE | ID: mdl-34455072

RESUMO

With the gradual increase in the COVID-19 mortality rate, there is an urgent need for an effective drug/vaccine. Several drugs like Remdesivir, Azithromycin, Favirapir, Ritonavir, Darunavir, etc., are put under evaluation in more than 300 clinical trials to treat COVID-19. On the other hand, several vaccines like Pfizer-BioNTech, Moderna, Johnson & Johnson's Janssen, Sputnik V, Covishield, Covaxin, etc., also evolved from the research study. While few of them already gets approved, others show encouraging results and are still under assessment. In parallel, there are also significant developments in new drug development. But, since the approval of new molecules takes substantial time, drug repurposing studies have also gained considerable momentum. The primary agent of the disease progression of COVID-19 is SARS-CoV2/nCoV, which is believed to have ~89% genetic resemblance with SARS-CoV, a coronavirus responsible for the massive outbreak in 2003. With this hypothesis, Human-SARS-CoV protein interactions are used to develop an in-silico Human-nCoV network by identifying potential COVID-19 human spreader proteins by applying the SIS model and fuzzy thresholding by a possible COVID-19 FDA drugs target-based validation. At first, the complete list of FDA drugs is identified for the level-1 and level-2 spreader proteins in this network, followed by applying a drug consensus scoring strategy. The same consensus strategy is involved in the second analysis but on a curated overlapping set of key genes/proteins identified from COVID-19 symptoms. Validation using subsequent docking study has also been performed on COVID-19 potential drugs with the available major COVID-19 crystal structures whose PDB IDs are: 6LU7, 6M2Q, 6W9C, 6M0J, 6M71 and 6VXX. Our computational study and docking results suggest that Fostamatinib (R406 as its active promoiety) may also be considered as one of the potential candidates for further clinical trials in pursuit to counter the spread of COVID-19.

Assuntos

Tratamento Farmacológico da COVID-19 , Reposicionamento de Medicamentos , Aminopiridinas , Antivirais/farmacologia , Antivirais/uso terapêutico , ChAdOx1 nCoV-19 , Reposicionamento de Medicamentos/métodos , Humanos , Simulação de Acoplamento Molecular , Morfolinas , Pirimidinas , RNA Viral , SARS-CoV-2

2.

Computational modeling of human-nCoV protein-protein interaction network.

Saha, Sovan; Halder, Anup Kumar; Bandyopadhyay, Soumyendu Sekhar; Chatterjee, Piyali; Nasipuri, Mita; Basu, Subhadip.

Methods ; 203: 488-497, 2022 07.

Artigo em Inglês | MEDLINE | ID: mdl-34902553

RESUMO

Novel coronavirus(SARS-CoV2) replicates the host cell's genome by interacting with the host proteins. Due to this fact, the identification of virus and host protein-protein interactions could be beneficial in understanding the disease transmission behavior of the virus as well as in potential COVID-19 drug identification. International Committee on Taxonomy of Viruses (ICTV) has declared that nCoV is highly genetically similar to the SARS-CoV epidemic in 2003 (â¼89% similarity). With this hypothesis, the present work focuses on developing a computational model for the nCoV-Human protein interaction network, using the experimentally validated SARS-CoV-Human protein interactions. Initially, level-1 and level-2 human spreader proteins are identified in the SARS-CoV-Human interaction network, using Susceptible-Infected-Susceptible (SIS) model. These proteins are considered potential human targets for nCoV bait proteins. A gene-ontology-based fuzzy affinity function has been used to construct the nCoV-Human protein interaction network at a â¼99.98% specificity threshold. This also identifies 37 level-1 human spreaders for COVID-19 in the human protein-interaction network. 2474 level-2 human spreaders are subsequently identified using the SIS model. The derived host-pathogen interaction network is finally validated using six potential FDA-listed drugs for COVID-19 with significant overlap between the known drug target proteins and the identified spreader proteins.

Assuntos

COVID-19 , SARS-CoV-2 , Simulação por Computador , Humanos , Mapas de Interação de Proteínas/genética , Proteínas , RNA Viral , SARS-CoV-2/genética

3.

RFCM-PALM: In-Silico Prediction of S-Palmitoylation Sites in the Synaptic Proteins for Male/Female Mouse Data.

Bandyopadhyay, Soumyendu Sekhar; Halder, Anup Kumar; Zareba-Koziol, Monika; Bartkowiak-Kaczmarek, Anna; Dutta, Aviinandaan; Chatterjee, Piyali; Nasipuri, Mita; Wójtowicz, Tomasz; Wlodarczyk, Jakub; Basu, Subhadip.

Int J Mol Sci ; 22(18)2021 Sep 14.

Artigo em Inglês | MEDLINE | ID: mdl-34576064

RESUMO

S-palmitoylation is a reversible covalent post-translational modification of cysteine thiol side chain by palmitic acid. S-palmitoylation plays a critical role in a variety of biological processes and is engaged in several human diseases. Therefore, identifying specific sites of this modification is crucial for understanding their functional consequences in physiology and pathology. We present a random forest (RF) classifier-based consensus strategy (RFCM-PALM) for predicting the palmitoylated cysteine sites on synaptic proteins from male/female mouse data. To design the prediction model, we have introduced a heuristic strategy for selection of the optimum set of physicochemical features from the AAIndex dataset using (a) K-Best (KB) features, (b) genetic algorithm (GA), and (c) a union (UN) of KB and GA based features. Furthermore, decisions from best-trained models of the KB, GA, and UN-based classifiers are combined by designing a three-star quality consensus strategy to further refine and enhance the scores of the individual models. The experiment is carried out on three categorized synaptic protein datasets of a male mouse, female mouse, and combined (male + female), whereas in each group, weighted data is used as training, and knock-out is used as the hold-out set for performance evaluation and comparison. RFCM-PALM shows ~80% area under curve (AUC) score in all three categories of datasets and achieve 10% average accuracy (male-15%, female-15%, and combined-7%) improvements on the hold-out set compared to the state-of-the-art approaches. To summarize, our method with efficient feature selection and novel consensus strategy shows significant performance gains in the prediction of S-palmitoylation sites in mouse datasets.

Assuntos

Algoritmos , Simulação por Computador , Lipoilação , Proteínas do Tecido Nervoso/metabolismo , Sinapses/metabolismo , Animais , Bases de Dados de Proteínas , Feminino , Masculino , Camundongos

4.

Passive Auto Focusing of Pathological Microscope with Intelligent Field Image Collection Mechanism.

Ghosh, Pramit; Bhattacharjee, Debotosh; Nasipuri, Mita.

J Med Syst ; 45(2): 25, 2021 Jan 16.

Artigo em Inglês | MEDLINE | ID: mdl-33452582

RESUMO

The microscope is one of the widely used pathological equipment to analyze body fluids like blood, sputum, etc. in granular level. In order to reduce workload on pathologists and strengthen the telehealth services, an automatic self-focusing microscope with different field image collection mechanism is required. In this work, the conversion of a compound microscope into a complete digital self-focusing automatic microscope, with intelligent field image collection mechanism, is discussed. This method uses passive autofocusing technique. In this method, most informative regions are identified on the basis of texture information. Features from these identified regions are used for autofocusing the microscope. This system is capable of collecting multiple snaps from different regions of the smear sample slides. The problem with the smear slide is that it has un-uniform thickness upon the glass slide. So some region has a very thick layer and some region has a very thin layer. In general, both of these regions are not considered for pathological analysis. The proposed system is capable to detect the region of smear slide which is suitable for collection of snap images. A soft computing approach is used to detect the desired regions of the sample in the slide. The Raspberry pi is used to design the control section. Multi-threaded parallel programming is used to optimize I/O execution and waiting time. The performance of the proposed system is satisfactory. The average peak signal-to-noise ratio (PSNR) is about 33 in comparison with manual focusing by the domain expert. The performance of the system in terms of computation time, which is calculated on the benchmark microscopic image dataset, is better than other learning-based methods. Autofocusing of pathological microscope with an intelligent field image collection mechanism is highly useful in the remote healthcare domain. This work basically describes a mechanism to migrate the conventional compound microscope into a tale-health service compatible (IoT enabled) microscope. This system is highly suitable for developing countries where an overall change of existing infrastructure is difficult due to economic reasons.

Assuntos

Processamento de Imagem Assistida por Computador , Microscopia , Humanos , Software , Escarro

5.

FunPred-1: protein function prediction from a protein interaction network using neighborhood analysis.

Saha, Sovan; Chatterjee, Piyali; Basu, Subhadip; Kundu, Mahantapas; Nasipuri, Mita.

Cell Mol Biol Lett ; 19(4): 675-91, 2014 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-25424913

RESUMO

Proteins are responsible for all biological activities in living organisms. Thanks to genome sequencing projects, large amounts of DNA and protein sequence data are now available, but the biological functions of many proteins are still not annotated in most cases. The unknown function of such non-annotated proteins may be inferred or deduced from their neighbors in a protein interaction network. In this paper, we propose two new methods to predict protein functions based on network neighborhood properties. FunPred 1.1 uses a combination of three simple-yet-effective scoring techniques: the neighborhood ratio, the protein path connectivity and the relative functional similarity. FunPred 1.2 applies a heuristic approach using the edge clustering coefficient to reduce the search space by identifying densely connected neighborhood regions. The overall accuracy achieved in FunPred 1.2 over 8 functional groups involving hetero-interactions in 650 yeast proteins is around 87%, which is higher than the accuracy with FunPred 1.1. It is also higher than the accuracy of many of the state-of-the-art protein function prediction methods described in the literature. The test datasets and the complete source code of the developed software are now freely available at http://code.google.com/p/cmaterbioinfo/ .

Assuntos

Mapeamento de Interação de Proteínas , Análise por Conglomerados , Bases de Dados de Proteínas , Modelos Biológicos , Anotação de Sequência Molecular , Mapas de Interação de Proteínas , Proteínas de Saccharomyces cerevisiae/fisiologia , Software

6.

EPI-SF: essential protein identification in protein interaction networks using sequence features.

Saha, Sovan; Chatterjee, Piyali; Basu, Subhadip; Nasipuri, Mita.

PeerJ ; 12: e17010, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38495766

RESUMO

Proteins are considered indispensable for facilitating an organism's viability, reproductive capabilities, and other fundamental physiological functions. Conventional biological assays are characterized by prolonged duration, extensive labor requirements, and financial expenses in order to identify essential proteins. Therefore, it is widely accepted that employing computational methods is the most expeditious and effective approach to successfully discerning essential proteins. Despite being a popular choice in machine learning (ML) applications, the deep learning (DL) method is not suggested for this specific research work based on sequence features due to the restricted availability of high-quality training sets of positive and negative samples. However, some DL works on limited availability of data are also executed at recent times which will be our future scope of work. Conventional ML techniques are thus utilized in this work due to their superior performance compared to DL methodologies. In consideration of the aforementioned, a technique called EPI-SF is proposed here, which employs ML to identify essential proteins within the protein-protein interaction network (PPIN). The protein sequence is the primary determinant of protein structure and function. So, initially, relevant protein sequence features are extracted from the proteins within the PPIN. These features are subsequently utilized as input for various machine learning models, including XGB Boost Classifier, AdaBoost Classifier, logistic regression (LR), support vector classification (SVM), Decision Tree model (DT), Random Forest model (RF), and Naïve Bayes model (NB). The objective is to detect the essential proteins within the PPIN. The primary investigation conducted on yeast examined the performance of various ML models for yeast PPIN. Among these models, the RF model technique had the highest level of effectiveness, as indicated by its precision, recall, F1-score, and AUC values of 0.703, 0.720, 0.711, and 0.745, respectively. It is also found to be better in performance when compared to the other state-of-arts based on traditional centrality like betweenness centrality (BC), closeness centrality (CC), etc. and deep learning methods as well like DeepEP, as emphasized in the result section. As a result of its favorable performance, EPI-SF is later employed for the prediction of novel essential proteins inside the human PPIN. Due to the tendency of viruses to selectively target essential proteins involved in the transmission of diseases within human PPIN, investigations are conducted to assess the probable involvement of these proteins in COVID-19 and other related severe diseases.

Assuntos

Mapas de Interação de Proteínas , Saccharomyces cerevisiae , Humanos , Teorema de Bayes , Proteínas/química , Aprendizado de Máquina

7.

Computational drug repurposing for viral infectious diseases: a case study on monkeypox.

Saha, Sovan; Chatterjee, Piyali; Nasipuri, Mita; Basu, Subhadip; Chakraborti, Tapabrata.

Brief Funct Genomics ; 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-38183212

RESUMO

The traditional method of drug reuse or repurposing has significantly contributed to the identification of new antiviral compounds and therapeutic targets, enabling rapid response to developing infectious illnesses. This article presents an overview of how modern computational methods are used in drug repurposing for the treatment of viral infectious diseases. These methods utilize data sets that include reviewed information on the host's response to pathogens and drugs, as well as various connections such as gene expression patterns and protein-protein interaction networks. We assess the potential benefits and limitations of these methods by examining monkeypox as a specific example, but the knowledge acquired can be applied to other comparable disease scenarios.

8.

GC-EnC: A Copula based ensemble of CNNs for malignancy identification in breast histopathology and cytology images.

Dey, Soumyajyoti; Mitra, Shyamali; Chakraborty, Sukanta; Mondal, Debashri; Nasipuri, Mita; Das, Nibaran.

Comput Biol Med ; 152: 106329, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36473342

RESUMO

In the present work, we have explored the potential of Copula-based ensemble of CNNs(Convolutional Neural Networks) over individual classifiers for malignancy identification in histopathology and cytology images. The Copula-based model that integrates three best performing CNN architectures, namely, DenseNet-161/201, ResNet-101/34, InceptionNet-V3 is proposed. Also, the limitation of small dataset is circumvented using a Fuzzy template based data augmentation technique that intelligently selects multiple region of interests (ROIs) from an image. The proposed framework of data augmentation amalgamated with the ensemble technique showed a gratifying performance in malignancy prediction surpassing the individual CNN's performance on breast cytology and histopathology datasets. The proposed method has achieved accuracies of 84.37%, 97.32%, 91.67% on the JUCYT, BreakHis and BI datasets respectively. This automated technique will serve as a useful guide to the pathologist in delivering the appropriate diagnostic decision in reduced time and effort. The relevant codes of the proposed ensemble model are publicly available on GitHub.

Assuntos

Neoplasias da Mama , Humanos , Feminino , Neoplasias da Mama/diagnóstico por imagem , Neoplasias da Mama/patologia , Redes Neurais de Computação , Mama/diagnóstico por imagem , Mama/patologia

9.

Correction: Deep-Fuzz: A synergistic integration of deep learning and fuzzy water flows for fine-grained nuclei segmentation in digital pathology.

Das, Nirmal; Saha, Satadal; Nasipuri, Mita; Basu, Subhadip; Chakraborti, Tapabrata.

PLoS One ; 18(11): e0295111, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-38011184

RESUMO

[This corrects the article DOI: 10.1371/journal.pone.0286862.].

10.

Deep-Fuzz: A synergistic integration of deep learning and fuzzy water flows for fine-grained nuclei segmentation in digital pathology.

Das, Nirmal; Saha, Satadal; Nasipuri, Mita; Basu, Subhadip; Chakraborti, Tapabrata.

PLoS One ; 18(6): e0286862, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37352172

RESUMO

Robust semantic segmentation of tumour micro-environment is one of the major open challenges in machine learning enabled computational pathology. Though deep learning based systems have made significant progress, their task agnostic data driven approach often lacks the contextual grounding necessary in biomedical applications. We present a novel fuzzy water flow scheme that takes the coarse segmentation output of a base deep learning framework to then provide a more fine-grained and instance level robust segmentation output. Our two stage synergistic segmentation method, Deep-Fuzz, works especially well for overlapping objects, and achieves state-of-the-art performance in four public cell nuclei segmentation datasets. We also show through visual examples how our final output is better aligned with pathological insights, and thus more clinically interpretable.

Assuntos

Aprendizado Profundo , Núcleo Celular , Aprendizado de Máquina , Água , Processamento de Imagem Assistida por Computador

11.

Assessment of GO-Based Protein Interaction Affinities in the Large-Scale Human-Coronavirus Family Interactome.

Bandyopadhyay, Soumyendu Sekhar; Halder, Anup Kumar; Saha, Sovan; Chatterjee, Piyali; Nasipuri, Mita; Basu, Subhadip.

Vaccines (Basel) ; 11(3)2023 Feb 25.

Artigo em Inglês | MEDLINE | ID: mdl-36992133

RESUMO

SARS-CoV-2 is a novel coronavirus that replicates itself via interacting with the host proteins. As a result, identifying virus and host protein-protein interactions could help researchers better understand the virus disease transmission behavior and identify possible COVID-19 drugs. The International Committee on Virus Taxonomy has determined that nCoV is genetically 89% compared to the SARS-CoV epidemic in 2003. This paper focuses on assessing the host-pathogen protein interaction affinity of the coronavirus family, having 44 different variants. In light of these considerations, a GO-semantic scoring function is provided based on Gene Ontology (GO) graphs for determining the binding affinity of any two proteins at the organism level. Based on the availability of the GO annotation of the proteins, 11 viral variants, viz., SARS-CoV-2, SARS, MERS, Bat coronavirus HKU3, Bat coronavirus Rp3/2004, Bat coronavirus HKU5, Murine coronavirus, Bovine coronavirus, Rat coronavirus, Bat coronavirus HKU4, Bat coronavirus 133/2005, are considered from 44 viral variants. The fuzzy scoring function of the entire host-pathogen network has been processed with ~180 million potential interactions generated from 19,281 host proteins and around 242 viral proteins. ~4.5 million potential level one host-pathogen interactions are computed based on the estimated interaction affinity threshold. The resulting host-pathogen interactome is also validated with state-of-the-art experimental networks. The study has also been extended further toward the drug-repurposing study by analyzing the FDA-listed COVID drugs.

12.

PFP-GO: Integrating protein sequence, domain and protein-protein interaction information for protein function prediction using ranked GO terms.

Sengupta, Kaustav; Saha, Sovan; Halder, Anup Kumar; Chatterjee, Piyali; Nasipuri, Mita; Basu, Subhadip; Plewczynski, Dariusz.

Front Genet ; 13: 969915, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36246645

RESUMO

Protein function prediction is gradually emerging as an essential field in biological and computational studies. Though the latter has clinched a significant footprint, it has been observed that the application of computational information gathered from multiple sources has more significant influence than the one derived from a single source. Considering this fact, a methodology, PFP-GO, is proposed where heterogeneous sources like Protein Sequence, Protein Domain, and Protein-Protein Interaction Network have been processed separately for ranking each individual functional GO term. Based on this ranking, GO terms are propagated to the target proteins. While Protein sequence enriches the sequence-based information, Protein Domain and Protein-Protein Interaction Networks embed structural/functional and topological based information, respectively, during the phase of GO ranking. Performance analysis of PFP-GO is also based on Precision, Recall, and F-Score. The same was found to perform reasonably better when compared to the other existing state-of-art. PFP-GO has achieved an overall Precision, Recall, and F-Score of 0.67, 0.58, and 0.62, respectively. Furthermore, we check some of the top-ranked GO terms predicted by PFP-GO through multilayer network propagation that affect the 3D structure of the genome. The complete source code of PFP-GO is freely available at https://sites.google.com/view/pfp-go/.

13.

JUPPI: A Multi-Level Feature Based Method for PPI Prediction and a Refined Strategy for Performance Assessment.

Halder, Anup Kumar; Bandyopadhyay, Soumyendu Sekhar; Chatterjee, Piyali; Nasipuri, Mita; Plewczynski, Dariusz; Basu, Subhadip.

IEEE/ACM Trans Comput Biol Bioinform ; 19(1): 531-542, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-32750875

RESUMO

Over the years, several methods have been proposed for the computational PPI prediction with different performance evaluation strategies. While attempting to benchmark performance scores, most of these methods often suffer with ill-treated cross-validation strategies, adhoc selection of positive/negative samples etc. To address these issues, in our proposed multi-level feature based PPI prediction approach (JUPPI), using sequence, domain and GO information as features, a refined evaluation strategy has been introduced. During the evaluation process, we first extract high quality negative data using three-stage filtering, and then introduce a pair-input based cross validation strategy with three difficulty levels for test-set predictions. Our proposed evaluation strategy reduces the component-level overlapping issue in test sets. Performance of JUPPI is compared with those of the state-of-the-art approaches in this domain and tested on six independent PPI datasets. In almost all the datasets, JUPPI outperforms the state-of-the-art not only at human proteome level for PPI prediction, but also for prediction of interactors for intrinsic disordered human proteins. https://figshare.com/projects/JUPPI_A_Multi-level_Feature_Based_Method_for_PPI_Prediction_and_a_Refined_Strategy_for_Performance_Assessment/81656 JUPPI tool and the developed datasets (JUPPId) are available in public domain for academic use along with supplementary materials, which can be found on the Computer Society Digital Library at http://doi.ieeecomputersociety.org/10.1109/TCBB.2020.3004970.

Assuntos

Biologia Computacional , Proteínas , Humanos

14.

ML-DTD: Machine Learning-Based Drug Target Discovery for the Potential Treatment of COVID-19.

Saha, Sovan; Chatterjee, Piyali; Halder, Anup Kumar; Nasipuri, Mita; Basu, Subhadip; Plewczynski, Dariusz.

Vaccines (Basel) ; 10(10)2022 Sep 30.

Artigo em Inglês | MEDLINE | ID: mdl-36298508

RESUMO

Recent research has highlighted that a large section of druggable protein targets in the Human interactome remains unexplored for various diseases. It might lead to the drug repurposing study and help in the in-silico prediction of new drug-human protein target interactions. The same applies to the current pandemic of COVID-19 disease in global health issues. It is highly desirable to identify potential human drug targets for COVID-19 using a machine learning approach since it saves time and labor compared to traditional experimental methods. Structure-based drug discovery where druggability is determined by molecular docking is only appropriate for the protein whose three-dimensional structures are available. With machine learning algorithms, differentiating relevant features for predicting targets and non-targets can be used for the proteins whose 3-D structures are unavailable. In this research, a Machine Learning-based Drug Target Discovery (ML-DTD) approach is proposed where a machine learning model is initially built up and tested on the curated dataset consisting of COVID-19 human drug targets and non-targets formed by using the Therapeutic Target Database (TTD) and human interactome using several classifiers like XGBBoost Classifier, AdaBoost Classifier, Logistic Regression, Support Vector Classification, Decision Tree Classifier, Random Forest Classifier, Naive Bayes Classifier, and K-Nearest Neighbour Classifier (KNN). In this method, protein features include Gene Set Enrichment Analysis (GSEA) ranking, properties derived from the protein sequence, and encoded protein network centrality-based measures. Among all these, XGBBoost, KNN, and Random Forest models are satisfactory and consistent. This model is further used to predict novel COVID-19 human drug targets, which are further validated by target pathway analysis, the emergence of allied repurposed drugs, and their subsequent docking study.

15.

Rule-Based Pruning and In Silico Identification of Essential Proteins in Yeast PPIN.

Banik, Anik; Podder, Souvik; Saha, Sovan; Chatterjee, Piyali; Halder, Anup Kumar; Nasipuri, Mita; Basu, Subhadip; Plewczynski, Dariusz.

Cells ; 11(17)2022 08 25.

Artigo em Inglês | MEDLINE | ID: mdl-36078056

RESUMO

Proteins are vital for the significant cellular activities of living organisms. However, not all of them are essential. Identifying essential proteins through different biological experiments is relatively more laborious and time-consuming than the computational approaches used in recent times. However, practical implementation of conventional scientific methods sometimes becomes challenging due to poor performance impact in specific scenarios. Thus, more developed and efficient computational prediction models are required for essential protein identification. An effective methodology is proposed in this research, capable of predicting essential proteins in a refined yeast protein-protein interaction network (PPIN). The rule-based refinement is done using protein complex and local interaction density information derived from the neighborhood properties of proteins in the network. Identification and pruning of non-essential proteins are equally crucial here. In the initial phase, careful assessment is performed by applying node and edge weights to identify and discard the non-essential proteins from the interaction network. Three cut-off levels are considered for each node and edge weight for pruning the non-essential proteins. Once the PPIN has been filtered out, the second phase starts with two centralities-based approaches: (1) local interaction density (LID) and (2) local interaction density with protein complex (LIDC), which are successively implemented to identify the essential proteins in the yeast PPIN. Our proposed methodology achieves better performance in comparison to the existing state-of-the-art techniques.

Assuntos

Mapas de Interação de Proteínas , Saccharomyces cerevisiae , Proteínas/metabolismo , Saccharomyces cerevisiae/metabolismo

16.

PPI_SVM: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables.

Chatterjee, Piyali; Basu, Subhadip; Kundu, Mahantapas; Nasipuri, Mita; Plewczynski, Dariusz.

Cell Mol Biol Lett ; 16(2): 264-78, 2011 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-21442443

RESUMO

Protein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting partners is not known. Recently, a novel prediction method was proposed by exploiting physical interactions of constituent domains. We propose here a novel knowledge-based prediction method, namely PPI_SVM, which predicts interactions between two protein sequences by exploiting their domain information. We trained a two-class support vector machine on the benchmarking set of pairs of interacting proteins extracted from the Database of Interacting Proteins (DIP). The method considers all possible combinations of constituent domains between two protein sequences, unlike most of the existing approaches. Moreover, it deals with both single-domain proteins and multi domain proteins; therefore it can be applied to the whole proteome in high-throughput studies. Our machine learning classifier, following a brainstorming approach, achieves accuracy of 86%, with specificity of 95%, and sensitivity of 75%, which are better results than most previous methods that sacrifice recall values in order to boost the overall precision. Our method has on average better sensitivity combined with good selectivity on the benchmarking dataset. The PPI_SVM source code, train/test datasets and supplementary files are available freely in the public domain at: http://code.google.com/p/cmater-bioinfo/.

Assuntos

Inteligência Artificial , Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas/métodos , Algoritmos , Bases de Dados de Proteínas

17.

LINPE-BL: A Local Descriptor and Broad Learning for Identification of Abnormal Breast Thermograms.

Pramanik, Sourav; Bhattacharjee, Debotosh; Nasipuri, Mita; Krejcar, Ondrej.

IEEE Trans Med Imaging ; 40(12): 3919-3931, 2021 12.

Artigo em Inglês | MEDLINE | ID: mdl-34329158

RESUMO

This paper proposes a novel local feature descriptor coined as a local instant-and-center-symmetric neighbor-based pattern of the extrema-images (LINPE) to detect breast abnormalities in thermal breast images. It is a hybrid descriptor that combines two different feature descriptors: one is the inverse-probability difference extrema (IpDE), and another is the local instant and center-symmetric neighbor-based pattern (LICsNP). IpDE is developed to compute the intensity-inhomogeneity-invariant feature-based image of the breast thermogram. Besides, the LICsNP is intended to capture the local microstructure pattern information in the IpDE image. A new paradigm, named Broad Learning (BL) network, is introduced here as a classifier to differentiate the healthy and sick breast thermograms efficiently. The efficacy of the proposed system is quantitatively validated on the images of DMR-IR and DBT-TU-JU databases. Extensive experimentation on these databases with an average accuracy of 96.90% and 94%, respectively, justifies proposed system's superiority in the differentiation of healthy and sick breast thermograms over the other related existing state-of-the-art methods. The proposed system also performs consistently in the presence of noise and rotational changes.

Assuntos

Mama , Termografia , Mama/diagnóstico por imagem , Bases de Dados Factuais

18.

Detection of spreader nodes in human-SARS-CoV protein-protein interaction network.

Saha, Sovan; Chatterjee, Piyali; Nasipuri, Mita; Basu, Subhadip.

PeerJ ; 9: e12117, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34567845

RESUMO

The entire world is witnessing the coronavirus pandemic (COVID-19), caused by a novel coronavirus (n-CoV) generally distinguished as Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). SARS-CoV-2 promotes fatal chronic respiratory disease followed by multiple organ failure, ultimately putting an end to human life. International Committee on Taxonomy of Viruses (ICTV) has reached a consensus that SARS-CoV-2 is highly genetically similar (up to 89%) to the Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV), which had an outbreak in 2003. With this hypothesis, current work focuses on identifying the spreader nodes in the SARS-CoV-human protein-protein interaction network (PPIN) to find possible lineage with the disease propagation pattern of the current pandemic. Various PPIN characteristics like edge ratio, neighborhood density, and node weight have been explored for defining a new feature spreadability index by which spreader proteins and protein-protein interaction (in the form of network edges) are identified. Top spreader nodes with a high spreadability index have been validated by Susceptible-Infected-Susceptible (SIS) disease model, first using a synthetic PPIN followed by a SARS-CoV-human PPIN. The ranked edges highlight the path of entire disease propagation from SARS-CoV to human PPIN (up to level-2 neighborhood). The developed network attribute, spreadability index, and the generated SIS model, compared with the other network centrality-based methodologies, perform better than the existing state-of-art.

19.

Protein function prediction from dynamic protein interaction network using gene expression data.

Saha, Sovan; Prasad, Abhimanyu; Chatterjee, Piyali; Basu, Subhadip; Nasipuri, Mita.

J Bioinform Comput Biol ; 17(4): 1950025, 2019 08.

Artigo em Inglês | MEDLINE | ID: mdl-31617461

RESUMO

Computational prediction of functional annotation of proteins is an uphill task. There is an ever increasing gap between functional characterization of protein sequences and deluge of protein sequences generated by large-scale sequencing projects. The dynamic nature of protein interactions is frequently observed which is mostly influenced by any new change of state or change in stimuli. Functional characterization of proteins can be inferred from their interactions with each other, which is dynamic in nature. In this work, we have used a dynamic protein-protein interaction network (PPIN), time course gene expression data and protein sequence information for prediction of functional annotation of proteins. During progression of a particular function, it has also been observed that not all the proteins are active at all time points. For unannotated active proteins, our proposed methodology explores the dynamic PPIN consisting of level-1 and level-2 neighboring proteins at different time points, filtered by Damerau-Levenshtein edit distance to estimate the similarity between two protein sequences and coefficient variation methods to assess the strength of an edge in a network. Finally, from the filtered dynamic PPIN, at each time point, functional annotations of the level-2 proteins are assigned to the unknown and unannotated active proteins through the level-1 neighbor, following a bottom-up strategy. Our proposed methodology achieves an average precision, recall and F-Score of 0.59, 0.76 and 0.61 respectively, which is significantly higher than the reported state-of-the-art methods.

Assuntos

Expressão Gênica , Mapeamento de Interação de Proteínas/métodos , Proteínas/metabolismo , Biologia Computacional/métodos , Bases de Dados Genéticas , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Proteínas/genética , Leveduras/genética

20.

Patch-based system for Classification of Breast Histology images using deep learning.

Roy, Kaushiki; Banik, Debapriya; Bhattacharjee, Debotosh; Nasipuri, Mita.

Comput Med Imaging Graph ; 71: 90-103, 2019 01.

Artigo em Inglês | MEDLINE | ID: mdl-30594745

RESUMO

In this work, we proposed a patch-based classifier (PBC) using Convolutional neural network (CNN) for automatic classification of histopathological breast images. Presence of limited images necessitated extraction of patches and augmentation to boost the number of training samples. Thus patches of suitable sizes carrying crucial diagnostic information were extracted from the original images. The proposed classification system works in two different modes: one patch in one decision (OPOD) and all patches in one decision (APOD). The proposed PBC first predicts the class label of each patch by OPOD mode. If that class label is the same for all the extracted patches and that is the class label of that image, then the output is considered as correct classification. In another mode that is APOD, the class label of each extracted patch is extracted as done in OPOD and a majority voting scheme takes the final decision about class label of the image. We have used ICIAR 2018 breast histology image dataset for this work which comprises of 4 different classes namely normal, benign, in situ and invasive carcinoma. Experimental results show that our proposed OPOD mode achieved a patch-wise classification accuracy of 77.4% for 4 and 84.7% for 2 histopathological classes respectively on the test set obtained by splitting the training dataset. Also, our proposed APOD technique achieved image-wise classification accuracy of 90% for 4-class and 92.5% for 2-class classification respectively on the split test set. Further, we have achieved accuracy of 87% on the hidden test dataset of ICIAR-2018.

Assuntos

Neoplasias da Mama/diagnóstico por imagem , Neoplasias da Mama/patologia , Aprendizado Profundo , Processamento de Imagem Assistida por Computador/métodos , Diagnóstico Diferencial , Feminino , Humanos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa