Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
Genomics ; 112(1): 174-183, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-30660789

RESUMO

Protein complexes are one of the most important functional units for deriving biological processes within the cell. Experimental methods have provided valuable data to infer protein complexes. However, these methods have inherent limitations. Considering these limitations, many computational methods have been proposed to predict protein complexes, in the last decade. Almost all of these in-silico methods predict protein complexes from the ever-increasing protein-protein interaction (PPI) data. These computational approaches usually use the PPI data in the format of a huge protein-protein interaction network (PPIN) as input and output various sub-networks of the given PPIN as the predicted protein complexes. Some of these methods have already reached a promising efficiency in protein complex detection. Nonetheless, there are challenges in prediction of other types of protein complexes, specially sparse and small ones. New methods should further incorporate the knowledge of biological properties of proteins to improve the performance. Additionally, there are several challenges that should be considered more effectively in designing the new complex prediction algorithms in the future. This article not only reviews the history of computational protein complex prediction but also provides new insight for improvement of new methodologies. In this article, most important computational methods for protein complex prediction are evaluated and compared. In addition, some of the challenges in the reconstruction of the protein complexes are discussed. Finally, various tools for protein complex prediction and PPIN analysis as well as the current high-throughput databases are reviewed.


Assuntos
Complexos Multiproteicos/metabolismo , Mapeamento de Interação de Proteínas , Biologia Computacional/métodos , Software
2.
J Transl Med ; 17(1): 71, 2019 03 04.
Artigo em Inglês | MEDLINE | ID: mdl-30832671

RESUMO

BACKGROUND: Angiogenesis inhibition research is a cutting edge area in angiogenesis-dependent disease therapy, especially in cancer therapy. Recently, studies on anti-angiogenic peptides have provided promising results in the field of cancer treatment. METHODS: A non-redundant dataset of 135 anti-angiogenic peptides (positive instances) and 135 non anti-angiogenic peptides (negative instances) was used in this study. Also, 20% of each class were selected to construct an independent test dataset (see Additional files 1, 2). We proposed an effective machine learning based R package (AntAngioCOOL) to predict anti-angiogenic peptides. We have examined more than 200 different classifiers to build an efficient predictor. Also, more than 17,000 features were extracted to encode the peptides. RESULTS: Finally, more than 2000 informative features were selected to train the classifiers for detecting anti-angiogenic peptides. AntAngioCOOL includes three different models that can be selected by the user for different purposes; it is the most sensitive, most specific and most accurate. According to the obtained results AntAngioCOOL can effectively suggest anti-angiogenic peptides; this tool achieved sensitivity of 88%, specificity of 77% and accuracy of 75% on the independent test set. AntAngioCOOL can be accessed at https://cran.r-project.org/ . CONCLUSIONS: Only 2% of the extracted descriptors were used to build the predictor models. The results revealed that physico-chemical profile is the most important feature type in predicting anti-angiogenic peptides. Also, atomic profile and PseAAC are the other important features.


Assuntos
Inibidores da Angiogênese/análise , Inibidores da Angiogênese/farmacologia , Proteínas Angiogênicas/antagonistas & inibidores , Biologia Computacional , Software , Humanos , Aprendizado de Máquina
3.
J Theor Biol ; 304: 96-102, 2012 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-22504445

RESUMO

Gene expression is the main cause for the existence of various phenotypes. Through this procedure, the information stored in DNA rises to the phenotype. Essentially, gene expression is dependent upon the successful binding of transcription factors (TFs) - a specific type of proteins - to explicit positions in its upstream, TF binding sites (TFBSs). Unfortunately, finding these TFBSs is costly and laborious; therefore, discovering TFBSs computationally is a significant problem that many researches endeavor to solve. In this paper, a new TFBS discovery method is presented by considering known biological facts about TFBSs. The input to this method includes sequences with arbitrary lengths and the output comprises positions that tend to be TFBS. Through the application of previous methods along with a method that focuses on biological and simulated datasets, it is shown that this method achieves higher accuracy in discovering TFBSs.


Assuntos
Alinhamento de Sequência/métodos , Fatores de Transcrição/metabolismo , Algoritmos , Animais , Sítios de Ligação/genética , Biologia Computacional/métodos , Escherichia coli/genética , Regulação da Expressão Gênica , Humanos , Ligação Proteica/genética
4.
J Biomed Inform ; 43(5): 800-4, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20546935

RESUMO

Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Software , Simulação por Computador , Doença/genética , Predisposição Genética para Doença/epidemiologia , Predisposição Genética para Doença/genética , Haplótipos , Humanos
5.
Med Chem ; 15(3): 216-230, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30484409

RESUMO

BACKGROUND: Noncoding RNAs (ncRNAs) which play an important role in various cellular processes are important in medicine as well as in drug design strategies. Different studies have shown that ncRNAs are dis-regulated in cancer cells and play an important role in human tumorigenesis. Therefore, it is important to identify and predict such molecules by experimental and computational methods, respectively. However, to avoid expensive experimental methods, computational algorithms have been developed for accurately and fast prediction of ncRNAs. OBJECTIVE: The aim of this review was to introduce the experimental and computational methods to identify and predict ncRNAs structure. Also, we explained the ncRNA's roles in cellular processes and drugs design, briefly. METHOD: In this survey, we will introduce ncRNAs and their roles in biological and medicinal processes. Then, some important laboratory techniques will be studied to identify ncRNAs. Finally, the state-of-the-art models and algorithms will be introduced along with important tools and databases. RESULTS: The results showed that the integration of experimental and computational approaches improves to identify ncRNAs. Moreover, the high accurate databases, algorithms and tools were compared to predict the ncRNAs. CONCLUSION: ncRNAs prediction is an exciting research field, but there are different difficulties. It requires accurate and reliable algorithms and tools. Also, it should be mentioned that computational costs of such algorithm including running time and usage memory are very important. Finally, some suggestions were presented to improve computational methods of ncRNAs gene and structural prediction.


Assuntos
RNA não Traduzido , Algoritmos , Simulação por Computador , Bases de Dados Factuais , Desenho de Fármacos , RNA não Traduzido/química , RNA não Traduzido/farmacologia , RNA não Traduzido/fisiologia
6.
Heliyon ; 4(7): e00705, 2018 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-30094375

RESUMO

Various cold-adapted organisms produce antifreeze proteins (AFPs), which prevent the freezing of cell fluids by inhibiting the growth of ice crystals. AFPs are currently being recognized in various organisms, living in extremely low temperatures. AFPs have several important applications in increasing freeze tolerance of plants, maintaining the tissue in frozen conditions and producing cold-hardy plants by applying transgenic technology. Substantial differences in the sequence and structure of the AFPs, pose a challenge for researchers to identify these proteins. In this paper, we proposed a novel method to identify AFPs, using supportive vector machine (SVM) by incorporating 4 types of features. Results of the two used benchmark datasets, revealed the strength of the proposed method in AFP prediction. According to the results of an independent test setup, our method outperformed the current state-of-the-art methods. In addition, the comparison results of the discrimination power of different feature types revealed that physicochemical descriptors are the most contributing features in AFP detection. This method has been implemented as a stand-alone tool, named afpCOOL, for various operating systems to predict AFPs with a user friendly graphical interface.

7.
Genes Genet Syst ; 88(5): 301-9, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24694393

RESUMO

Gene expression is a highly regulated biological process that is fundamental to the existence of phenotypes of any living organism. The regulatory relations are usually modeled as a network; simply, every gene is modeled as a node and relations are shown as edges between two related genes. This paper presents a novel method for inferring correlation networks, networks constructed by connecting co-expressed genes, through predicting co-expression level from genes promoter's sequences. According to the results, this method works well on biological data and its outcome is comparable to the methods that use microarray as input. The method is written in C++ language and is available upon request from the corresponding author.


Assuntos
Redes Reguladoras de Genes , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Software , Fatores de Transcrição/genética , Algoritmos , Sítios de Ligação , Expressão Gênica , Redes Neurais de Computação , Regiões Promotoras Genéticas , Ligação Proteica , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Fatores de Transcrição/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA