Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Integr Bioinform ; 20(3)2023 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-37978846

RESUMO

Hepatocellular carcinoma (HCC) has been associated with hepatitis C viral (HCV) infection as a potential risk factor. Nonetheless, the precise genetic regulatory mechanisms triggered by the virus, leading to virus-induced hepatocarcinogenesis, remain unclear. We hypothesized that HCV proteins might modulate the activity of aberrantly methylated HCC genes through regulatory pathways. Virus-host regulatory pathways, interactions between proteins, gene expression, transport, and stability regulation, were reconstructed using the ANDSystem. Gene expression regulation was statistically significant. Gene network analysis identified four out of 70 HCC marker genes whose expression regulation by viral proteins may be associated with HCC: DNA-binding protein inhibitor ID - 1 (ID1), flap endonuclease 1 (FEN1), cyclin-dependent kinase inhibitor 2A (CDKN2A), and telomerase reverse transcriptase (TERT). It suggested the following viral protein effects in HCV/human protein heterocomplexes: HCV NS3(p70) protein activates human STAT3 and NOTC1; NS2-3(p23), NS5B(p68), NS1(E2), and core(p21) activate SETD2; NS5A inhibits SMYD3; and NS3 inhibits CCN2. Interestingly, NS3 and E1(gp32) activate c-Jun when it positively regulates CDKN2A and inhibit it when it represses TERT. The discovered regulatory mechanisms might be key areas of focus for creating medications and preventative therapies to decrease the likelihood of HCC development during HCV infection.


Assuntos
Carcinoma Hepatocelular , Hepatite C , Neoplasias Hepáticas , Viroses , Humanos , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/metabolismo , Carcinoma Hepatocelular/patologia , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/metabolismo , Proteínas não Estruturais Virais/genética , Proteínas não Estruturais Virais/metabolismo , Hepacivirus/genética , Hepacivirus/metabolismo , Hepatite C/complicações , Hepatite C/genética , Histona-Lisina N-Metiltransferase
2.
J Integr Bioinform ; 20(3)2023 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-37978847

RESUMO

Bacillus strains are ubiquitous in the environment and are widely used in the microbiological industry as valuable enzyme sources, as well as in agriculture to stimulate plant growth. The Bacillus genus comprises several closely related groups of species. The rapid classification of these remains challenging using existing methods. Techniques based on MALDI-TOF MS data analysis hold significant promise for fast and precise microbial strains classification at both the genus and species levels. In previous work, we proposed a geometric approach to Bacillus strain classification based on mass spectra analysis via the centroid method (CM). One limitation of such methods is the noise in MS spectra. In this study, we used a denoising autoencoder (DAE) to improve bacteria classification accuracy under noisy MS spectra conditions. We employed a denoising autoencoder approach to convert noisy MS spectra into latent variables representing molecular patterns in the original MS data, and the Random Forest method to classify bacterial strains by latent variables. Comparison of the DAE-RF with the CM method using the artificially noisy test samples showed that DAE-RF offers higher noise robustness. Hence, the DAE-RF method could be utilized for noise-robust, fast, and neat classification of Bacillus species according to MALDI-TOF MS data.


Assuntos
Bacillus , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz/métodos , Bactérias
3.
Int J Mol Sci ; 23(23)2022 Nov 29.
Artigo em Inglês | MEDLINE | ID: mdl-36499269

RESUMO

The body of scientific literature continues to grow annually. Over 1.5 million abstracts of biomedical publications were added to the PubMed database in 2021. Therefore, developing cognitive systems that provide a specialized search for information in scientific publications based on subject area ontology and modern artificial intelligence methods is urgently needed. We previously developed a web-based information retrieval system, ANDDigest, designed to search and analyze information in the PubMed database using a customized domain ontology. This paper presents an improved ANDDigest version that uses fine-tuned PubMedBERT classifiers to enhance the quality of short name recognition for molecular-genetics entities in PubMed abstracts on eight biological object types: cell components, diseases, side effects, genes, proteins, pathways, drugs, and metabolites. This approach increased average short name recognition accuracy by 13%.


Assuntos
Inteligência Artificial , Mineração de Dados , Mineração de Dados/métodos , PubMed , Bases de Dados Factuais , Proteínas
4.
Life (Basel) ; 12(6)2022 Jun 02.
Artigo em Inglês | MEDLINE | ID: mdl-35743861

RESUMO

Coal worker's pneumoconiosis (CWP) is an occupationally induced progressive fibrotic lung disease. This irreversible but preventable disease currently affects millions across the world, mainly in countries with developed coal mining industries. Here, we report a pilot study that explores the sputum microbiome as a potential non-invasive bacterial biomarker of CWP status. Sputum samples were collected from 35 former and active coal miners diagnosed with CWP and 35 healthy controls. Sequencing of bacterial 16S rRNA genes was used to study the taxonomic composition of the respiratory microbiome. There was no difference in alpha diversity between CWP and controls. The structure of bacterial communities in sputum samples (ß diversity) differed significantly between cases and controls (pseudo-F = 3.61; p = 0.004). A significant increase in the abundance of Streptococcus (25.12 ± 11.37 vs. 16.85 ± 11.35%; p = 0.0003) was detected in samples from CWP subjects as compared to controls. The increased representation of Streptococcus in sputum from CWP patients was associated only with the presence of occupational pulmonary fibrosis, but did not depend on age, and did not differ between former and current miners. The study shows, for the first time, that the sputum microbiota of CWP subjects differs from that of controls. The results of our present exploratory study warrant further investigations on a larger cohort.

5.
Cancers (Basel) ; 13(2)2021 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-33440616

RESUMO

In previous studies, we described a method for detecting and typing malignant tumors of the thyroid gland in fine-needle aspiration biopsy samples via analysis of a molecular marker panel (normalized HMGA2 mRNA level; normalized microRNA-146b, -221, and -375 levels; mitochondrial-to-nuclear DNA ratio; and BRAFV600E mutation) in cytological preparations by quantitative PCR. In the present study, we aimed to estimate the specificity of the typing of different thyroid tumors by the proposed method. Fine-needle aspiration cytological preparations from 278 patients were used. The histological diagnosis was known for each sample. The positive and negative predictive values of the method assessed in this study were, respectively, 100% and 98% for papillary thyroid carcinoma (n = 63), 100% and 100% for medullary thyroid carcinoma (n = 19), 43.5% and 98% for follicular carcinoma (n = 15), and 86% and 100% for Hürthle cell carcinoma (n = 6). Thus, we demonstrate that the diagnostic panel, including the analysis of microRNA expression, mRNA expression, the BRAFV600E mutation, and the mitochondrial-to-nuclear DNA ratio, allows the highly accurate identification of papillary thyroid carcinoma, medullary thyroid carcinoma, and Hürthle cell carcinoma but not malignant follicular tumors (positive predictive value was below 50%).

6.
BMC Bioinformatics ; 21(Suppl 11): 228, 2020 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-32921303

RESUMO

BACKGROUND: The rapid growth of scientific literature has rendered the task of finding relevant information one of the critical problems in almost any research. Search engines, like Google Scholar, Web of Knowledge, PubMed, Scopus, and others, are highly effective in document search; however, they do not allow knowledge extraction. In contrast to the search engines, text-mining systems provide extraction of knowledge with representations in the form of semantic networks. Of particular interest are tools performing a full cycle of knowledge management and engineering, including automated retrieval, integration, and representation of knowledge in the form of semantic networks, their visualization, and analysis. STRING, Pathway Studio, MetaCore, and others are well-known examples of such products. Previously, we developed the Associative Network Discovery System (ANDSystem), which also implements such a cycle. However, the drawback of these systems is dependence on the employed ontologies describing the subject area, which limits their functionality in searching information based on user-specified queries. RESULTS: The ANDDigest system is a new web-based module of the ANDSystem tool, permitting searching within PubMed by using dictionaries from the ANDSystem tool and sets of user-defined keywords. ANDDigest allows performing the search based on complex queries simultaneously, taking into account many types of objects from the ANDSystem's ontology. The system has a user-friendly interface, providing sorting, visualization, and filtering of the found information, including mapping of mentioned objects in text, linking to external databases, sorting of data by publication date, citations number, journal H-indices, etc. The system provides data on trends for identified entities based on dynamics of interest according to the frequency of their mentions in PubMed by years. CONCLUSIONS: The main feature of ANDDigest is its functionality, serving as a specialized search for information about multiple associative relationships of objects from the ANDSystem's ontology vocabularies, taking into account user-specified keywords. The tool can be applied to the interpretation of experimental genetics data, the search for associations between molecular genetics objects, and the preparation of scientific and analytical reviews. It is presently available at https://anddigest.sysbio.ru/ .


Assuntos
Mineração de Dados/métodos , Internet , Software , Bases de Dados Factuais , PubMed
7.
J Clin Pathol ; 73(11): 722-727, 2020 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-32213552

RESUMO

AIMS: Analysis of molecular markers in addition to cytological analysis of fine-needle aspiration (FNA) samples is a promising way to improve the preoperative diagnosis of thyroid nodules. Previously, we have developed an algorithm for the differential diagnosis of thyroid nodules by means of a small set of molecular markers. Here, we aimed to validate this approach using FNA cytology samples of Bethesda categories III and IV, in which preoperative detection of malignancy by cytological analysis is impossible. METHODS: A total of 122 FNA smears from patients with indeterminate cytology (Bethesda III: 13 patients, Bethesda IV: 109 patients) were analysed by real-time PCR regarding the preselected set of molecular markers (the BRAF V600E mutation, normalised concentrations of HMGA2 mRNA, 3 microRNAs, and the mitochondrial/nuclear DNA ratio). The decision tree-based classifier was used to discriminate between benign and malignant tumours. RESULTS: The molecular testing detected malignancy in FNA smears of indeterminate cytology with 89.2% sensitivity, 84.6% positive predictive value, 92.9% specificity and 95.2% negative predictive value; these characteristics are comparable with those of more complicated commercial tests. Residual risk of malignancy for the thyroid nodules that were shown to be benign by this molecular method did not exceed the reported risk of malignancy for Bethesda II histological diagnosis. Analytical-accuracy assessment revealed required nucleic-acid input of ≥5 ng. CONCLUSIONS: The study shows feasibility of preoperative differential diagnosis of thyroid nodules of indeterminate cytology using a small panel of molecular markers of different types by a simple PCR-based method using stained FNA smears.


Assuntos
Algoritmos , Proteína HMGA2/genética , MicroRNAs/genética , Neoplasias da Glândula Tireoide/diagnóstico , Nódulo da Glândula Tireoide/diagnóstico , Adulto , Idoso , Biópsia por Agulha Fina , Citodiagnóstico , Diagnóstico Diferencial , Estudos de Viabilidade , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Neoplasias , Período Pré-Operatório , RNA Mensageiro/genética , Reação em Cadeia da Polimerase em Tempo Real , Sensibilidade e Especificidade , Glândula Tireoide/patologia , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia , Nódulo da Glândula Tireoide/genética , Nódulo da Glândula Tireoide/patologia
8.
BMC Cancer ; 19(1): 1010, 2019 Oct 28.
Artigo em Inglês | MEDLINE | ID: mdl-31660895

RESUMO

BACKGROUND: Analysis of molecular markers in addition to cytological analysis of fine-needle aspiration (FNA) samples is a promising way to improve the preoperative diagnosis of thyroid nodules. Nonetheless, in clinical practice, applications of existing diagnostic solutions based on the detection of somatic mutations or analysis of gene expression are limited by their high cost and difficulties with clinical interpretation. The aim of our work was to develop an algorithm for the differential diagnosis of thyroid nodules on the basis of a small set of molecular markers analyzed by real-time PCR. METHODS: A total of 494 preoperative FNA samples of thyroid goiters and tumors from 232 patients with known histological reports were analyzed: goiter, 105 samples (50 patients); follicular adenoma, 101 (48); follicular carcinoma, 43 (28); Hürthle cell carcinoma, 25 (11); papillary carcinoma, 121 (56); follicular variant of papillary carcinoma, 80 (32); and medullary carcinoma, 19 (12). Total nucleic acids extracted from dried FNA smears were analyzed for five somatic point mutations and two translocations typical of thyroid tumors as well as for relative concentrations of HMGA2 mRNA and 13 microRNAs and the ratio of mitochondrial to nuclear DNA by real-time PCR. A decision tree-based algorithm was built to discriminate benign and malignant tumors and to type the thyroid cancer. Leave-p-out cross-validation with five partitions was performed to estimate prediction quality. A comparison of two independent samples by quantitative traits was carried out via the Mann-Whitney U test. RESULTS: A minimum set of markers was selected (levels of HMGA2 mRNA and miR-375, - 221, and -146b in combination with the mitochondrial-to-nuclear DNA ratio) and yielded highly accurate discrimination (sensitivity = 0.97; positive predictive value = 0.98) between goiters with benign tumors and malignant tumors and accurate typing of papillary, medullary, and Hürthle cell carcinomas. The results support an alternative classification of follicular tumors, which differs from the histological one. CONCLUSIONS: The study shows the feasibility of the preoperative differential diagnosis of thyroid nodules using a panel of several molecular markers by a simple PCR-based method. Combining markers of different types increases the accuracy of classification.


Assuntos
DNA Mitocondrial/genética , Técnicas de Apoio para a Decisão , Bócio/diagnóstico , Proteína HMGA2/genética , MicroRNAs/genética , RNA Mensageiro/genética , Neoplasias da Glândula Tireoide/diagnóstico , Adulto , Idoso , Algoritmos , Biomarcadores Tumorais/genética , Biópsia por Agulha Fina , Confiabilidade dos Dados , Diagnóstico Diferencial , Estudos de Viabilidade , Feminino , Bócio/patologia , Humanos , Masculino , Pessoa de Meia-Idade , Período Pré-Operatório , Reação em Cadeia da Polimerase em Tempo Real , Neoplasias da Glândula Tireoide/patologia , Translocação Genética
9.
Methods Mol Biol ; 1934: 1-20, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31256369

RESUMO

The increase in the number of Web-based resources on posttranslational modification sites (PTMSs) in proteins is accelerating. This chapter presents a set of computational protocols describing how to work with the Internet resources when dealing with PTMSs. The protocols are intended for querying in PTMS-related databases, search of the PTMSs in the protein sequences and structures, and calculating the pI and molecular mass of the PTM isoforms. Thus, the modern bioinformatics prediction tools make it feasible to express protein modification in broader quantitative terms.


Assuntos
Biologia Computacional/métodos , Internet , Processamento de Proteína Pós-Traducional , Proteínas , Software , Sequência de Aminoácidos , Bases de Dados de Proteínas , Peso Molecular , Proteínas/química , Proteínas/metabolismo , Ferramenta de Busca , Interface Usuário-Computador , Navegador
10.
BMC Med Genomics ; 12(Suppl 2): 47, 2019 03 13.
Artigo em Inglês | MEDLINE | ID: mdl-30871556

RESUMO

BACKGROUND: Currently, more than 150 million people worldwide suffer from lymphedema. It is a chronic progressive disease characterized by high-protein edema of various parts of the body due to defects in lymphatic drainage. Molecular-genetic mechanisms of the disease are still poorly understood. Beginning of a clinical manifestation of primary lymphedema in middle age and the development of secondary lymphedema after treatment of breast cancer can be genetically determined. Disruption of endothelial cell apoptosis can be considered as one of the factors contributing to the development of lymphedema. However, a study of the relationship between genes associated with lymphedema and genes involved in endothelial apoptosis, in the associative gene network was not previously conducted. METHODS: In the current work, we used well-known methods (ToppGene and Endeavour), as well as methods previously developed by us, to prioritize genes involved in endothelial apoptosis and to find potential participants of molecular-genetic mechanisms of lymphedema among them. Original methods of prioritization took into account the overrepresented Gene Ontology biological processes, the centrality of vertices in the associative gene network, describing the interactions of endothelial apoptosis genes with genes associated with lymphedema, and the association of the analyzed genes with diseases that are comorbid to lymphedema. RESULTS: An assessment of the quality of prioritization was performed using criteria, which involved an analysis of the enrichment of the top-most priority genes by genes, which are known to have simultaneous interactions with lymphedema and endothelial cell apoptosis, as well as by genes differentially expressed in murine model of lymphedema. In particular, among genes involved in endothelial apoptosis, KDR, TNF, TEK, BMPR2, SERPINE1, IL10, CD40LG, CCL2, FASLG and ABL1 had the highest priority. The identified priority genes can be considered as candidates for genotyping in the studies involving the search for associations with lymphedema. CONCLUSIONS: Analysis of interactions of these genes in the associative gene network of lymphedema can improve understanding of mechanisms of interaction between endothelial apoptosis and lymphangiogenesis, and shed light on the role of disturbance of these processes in the development of edema, chronic inflammation and connective tissue transformation during the progression of the disease.


Assuntos
Apoptose , Redes Reguladoras de Genes , Linfedema/patologia , Software , Animais , Receptores de Proteínas Morfogenéticas Ósseas Tipo II/genética , Quimiocina CCL2/genética , Bases de Dados Genéticas , Modelos Animais de Doenças , Células Endoteliais/citologia , Células Endoteliais/metabolismo , Linfedema/genética , Camundongos , Receptor 2 de Fatores de Crescimento do Endotélio Vascular/genética
11.
BMC Bioinformatics ; 20(Suppl 1): 34, 2019 Feb 05.
Artigo em Inglês | MEDLINE | ID: mdl-30717676

RESUMO

BACKGROUND: Consideration of tissue-specific gene expression in reconstruction and analysis of molecular genetic networks is necessary for a proper description of the processes occurring in a specified tissue. Currently, there are a number of computer systems that allow the user to reconstruct molecular-genetic networks using the data automatically extracted from the texts of scientific publications. Examples of such systems are STRING, Pathway Commons, MetaCore and Ingenuity. The MetaCore and Ingenuity systems permit taking into account tissue-specific gene expression during the reconstruction of gene networks. Previously, we developed the ANDSystem tool, which also provides an automated extraction of knowledge from scientific texts and allows the reconstruction of gene networks. The main difference between our system and other tools is in the different types of interactions between objects, which makes the ANDSystem complementary to existing well-known systems. However, previous versions of the ANDSystem did not contain any information on tissue-specific expression. RESULTS: A new version of the ANDSystem has been developed. It offers the reconstruction of associative gene networks while taking into account the tissue-specific gene expression. The ANDSystem knowledge base features information on tissue-specific expression for 272 tissues. The system allows the reconstruction of combined gene networks, as well as performing the filtering of genes from such networks using the information on their tissue-specific expression. As an example of the application of such filtering, the gene network of the extrinsic apoptotic signaling pathway was analyzed. It was shown that considering different tissues can lead to changes in gene network structure, including changes in such indicators as betweenness centrality of vertices, clustering coefficient, network centralization, network density, etc. CONCLUSIONS: The consideration of tissue specificity can play an important role in the analysis of gene networks, in particular solving the problem of finding the most significant central genes. Thus, the new version of ANDSystem can be employed for a wide range of tasks related to biomedical studies of individual tissues. It is available at http://www-bionet.sscc.ru/and/cell /.


Assuntos
Mineração de Dados , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Especificidade de Órgãos/genética , Publicações , Apoptose/genética , Humanos , Semântica , Transdução de Sinais/genética
12.
BMC Genomics ; 19(Suppl 3): 76, 2018 02 09.
Artigo em Inglês | MEDLINE | ID: mdl-29504895

RESUMO

BACKGROUND: Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. RESULTS: We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. CONCLUSIONS: FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of experimental gene sets, both for different global networks and for different types of interactions. Using examples of thyroid cancer and apoptosis networks, we have shown that the links over-represented in the analyzed network in comparison with the random ones make possible a biological interpretation of the original gene/protein sets. The FunGeneNet web tool for assessment of the functional enrichment of networks is available at http://www-bionet.sscc.ru/fungenenet/ .


Assuntos
Redes Reguladoras de Genes , Genômica/métodos , Internet , Apoptose , Bases de Dados Genéticas , Ontologia Genética , Humanos , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia
13.
BMC Med Genomics ; 11(Suppl 1): 15, 2018 02 13.
Artigo em Inglês | MEDLINE | ID: mdl-29504915

RESUMO

BACKGROUND: Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. RESULTS: Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in biological processes related to the functioning of central nervous system. CONCLUSIONS: The application of methods of reconstruction and analysis of gene networks is a productive tool for studying the molecular mechanisms of comorbid conditions. The method put forth to rank genes by their importance to the comorbid condition of asthma and hypertension was employed that resulted in prediction of 10 genes, playing the key role in the development of the comorbid condition. The results can be utilised to plan experiments for identification of novel candidate genes along with searching for novel pharmacological targets.


Assuntos
Asma/genética , Biomarcadores/análise , Doenças do Sistema Nervoso Central/etiologia , Biologia Computacional/métodos , Mineração de Dados/métodos , Redes Reguladoras de Genes , Hipertensão/genética , Asma/epidemiologia , Catalase/genética , Comorbidade , Perfilação da Expressão Gênica , Humanos , Hipertensão/epidemiologia , Interleucina-10/genética , Software , Receptor 4 Toll-Like/genética
14.
J Integr Bioinform ; 15(4)2018 Dec 25.
Artigo em Inglês | MEDLINE | ID: mdl-30864351

RESUMO

Comorbid states of diseases significantly complicate diagnosis and treatment. Molecular mechanisms of comorbid states of asthma and hypertension are still poorly understood. Prioritization is a way for identifying genes involved in complex phenotypic traits. Existing methods of prioritization consider genetic, expression and evolutionary data, molecular-genetic networks and other. In the case of molecular-genetic networks, as a rule, protein-protein interactions and KEGG networks are used. ANDSystem allows reconstructing associative gene networks, which include more than 20 types of interactions, including protein-protein interactions, expression regulation, transport, catalysis, etc. In this work, a set of genes has been prioritized to find genes potentially involved in asthma and hypertension comorbidity. The prioritization was carried out using well-known methods (ToppGene and Endeavor) and a cross-talk centrality criterion, calculated by analysis of associative gene networks from ANDSystem. The identified genes, including IL1A, CD40LG, STAT3, IL15, FAS, APP, TLR2, C3, IL13 and CXCL10, may be involved in the molecular mechanisms of comorbid asthma/hypertension. An analysis of the dynamics of the frequency of mentioning the most priority genes in scientific publications revealed that the top 100 priority genes are significantly enriched with genes with increased positive dynamics, which may be a positive sign for further studies of these genes.


Assuntos
Asma/genética , Biomarcadores/análise , Biologia Computacional/métodos , Redes Reguladoras de Genes , Hipertensão/genética , Asma/epidemiologia , Comorbidade , Mineração de Dados , Alemanha/epidemiologia , Humanos , Hipertensão/epidemiologia , Software
15.
J Bioinform Comput Biol ; 15(2): 1650044, 2017 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-28110602

RESUMO

Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Eucariotos/genética , Proteínas/metabolismo , Animais , Apoptose/genética , Éxons , Humanos , Proteínas/genética , Software
16.
Oncol Rep ; 36(5): 2501-2510, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27666315

RESUMO

Fine needle aspiration cytology (FNAC) is currently the method of choice for malignancy prediction in thyroid nodules. Nevertheless, in some cases the interpretation of FNAC results may be problematic due to limitations of the method. The expression level of some microRNAs changes with the development of thyroid tumors, and its quantitation can be used to refine the FNAC results. For this quantitation to be reliable, the obtained data must be adequately normalized. Currently, no reference genes are universally recognized for quantitative assessments of microRNAs in thyroid nodules. The aim of the present study was the selection and validation of such reference genes. Expression of 800 microRNAs in 5 paired samples of thyroid surgical material corresponding to different histotypes of tumors was analyzed using Nanostring technology and four of these (hsa-miR-151a-3p, -197-3p, -99a-5p and -214-3p) with the relatively low variation coefficient were selected. The possibility of use of the selected microRNAs and their combination as references was estimated by RT-qPCR on a sampling of cytological smears: benign (n=226), atypia of undetermined significance (n=9), suspicious for follicular neoplasm (n=61), suspicious for malignancy (n=19), medullary thyroid carcinoma (MTC) (n=32), papillary thyroid carcinoma (PTC) (n=54) and non-diagnostic material (ND) (n=34). In order to assess the expression stability of the references, geNorm algorithm was used. The maximum stability was observed for the normalization factor obtained by the combination of all 4 microRNAs. Further validation of the complex normalizer and individual selected microRNAs was performed using 5 different classification methods on 3 groups of FNAC smears from the analyzed batch: benign neoplasms, MTC and PTC. In all cases, the use of the complex classifier resulted in the reduced number of errors. On using the complex microRNA normalizer, the decision-tree method C4.5 makes it possible to distinguish between malignant and benign thyroid neoplasms in cytological smears with high overall accuracy (>91%).


Assuntos
Biomarcadores Tumorais/biossíntese , Carcinoma Neuroendócrino/diagnóstico , Carcinoma/diagnóstico , MicroRNAs/biossíntese , Neoplasias da Glândula Tireoide/diagnóstico , Biomarcadores Tumorais/genética , Biópsia por Agulha Fina , Carcinoma/genética , Carcinoma/patologia , Carcinoma Neuroendócrino/genética , Carcinoma Neuroendócrino/patologia , Carcinoma Papilar , Citodiagnóstico/métodos , Diagnóstico Diferencial , Regulação Neoplásica da Expressão Gênica , Humanos , MicroRNAs/genética , Neoplasias/diagnóstico , Neoplasias/genética , Neoplasias/patologia , Câncer Papilífero da Tireoide , Glândula Tireoide/metabolismo , Glândula Tireoide/patologia , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia
17.
Virus Res ; 218: 40-8, 2016 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-26673098

RESUMO

A study of the molecular genetics mechanisms of host-pathogen interactions is of paramount importance in developing drugs against viral diseases. Currently, the literature contains a huge amount of information that describes interactions between HCV and human proteins. In addition, there are many factual databases that contain experimentally verified data on HCV-host interactions. The sources of such data are the original data along with the data manually extracted from the literature. However, the manual analysis of scientific publications is time consuming and, because of this, databases created with such an approach often do not have complete information. One of the most promising methods to provide actualisation and completeness of information is text mining. Here, with the use of a previously developed method by the authors using ANDSystem, an automated extraction of information on the interactions between HCV and human proteins was conducted. As a data source for the text mining approach, PubMed abstracts and full text articles were used. Additionally, external factual databases were analyzed. On the basis of this analysis, a special version of ANDSystem, extended with the HCV interactome, was created. The HCV interactome contains information about the interactions between 969 human and 11 HCV proteins. Among the 969 proteins, 153 'new' proteins were found not previously referred to in any external databases of protein-protein interactions for HCV-host interactions. Thus, the extended ANDSystem possesses a more comprehensive detailing of HCV-host interactions versus other existing databases. It was interesting that HCV proteins more preferably interact with human proteins that were already involved in a large number of protein-protein interactions as well as those associated with many diseases. Among human proteins of the HCV interactome, there were a large number of proteins regulated by microRNAs. It turned out that the results obtained for protein-protein interactions and microRNA-regulation did not depend on how well the proteins were studied, while protein-disease interactions appeared to be dependent on the level of study. In particular, the mean number of diseases linked to well-studied proteins (proteins were considered well-studied if they were mentioned in 50 or more PubMed publications) from the HCV interactome was 20.8, significantly exceeding the mean number of associations with diseases (10.1) for the total set of well-studied human proteins present in ANDSystem. For proteins not highly poorly-studied investigated, proteins from the HCV interactome (each protein was referred to in less than 50 publications) distribution of the number of diseases associated with them had no statistically significant differences from the distribution of the number of diseases associated with poorly-studied proteins based on the total set of human proteins stored in ANDSystem. With this, the average number of associations with diseases for the HCV interactome and the total set of human proteins were 0.3 and 0.2, respectively. Thus, ANDSystem, extended with the HCV interactome, can be helpful in a wide range of issues related to analyzing HCV-host interactions in the search for anti-HCV drug targets. The demo version of the extended ANDSystem covered here containing only interactions between human proteins, genes, metabolites, diseases, miRNAs and molecular-genetic pathways, as well as interactions between human proteins/genes and HCV proteins, is freely available at the following web address: http://www-bionet.sscc.ru/psd/andhcv/.


Assuntos
Algoritmos , Mineração de Dados/métodos , Hepacivirus/genética , Hepatite C/genética , Receptores Virais/genética , Proteínas Virais/genética , Mineração de Dados/estatística & dados numéricos , Bases de Dados Factuais , Regulação da Expressão Gênica , Hepacivirus/metabolismo , Hepatite C/metabolismo , Hepatite C/virologia , Interações Hospedeiro-Patógeno , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , Complexos Multiproteicos/genética , Complexos Multiproteicos/metabolismo , Mapeamento de Interação de Proteínas , PubMed/estatística & dados numéricos , Receptores Virais/metabolismo , Proteínas Virais/metabolismo
18.
BMC Genomics ; 16 Suppl 13: S2, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26693737

RESUMO

BACKGROUND: Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. RESULTS: One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. CONCLUSIONS: These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.


Assuntos
Domínio Catalítico , Biologia Computacional , Éxons , Proteínas/genética , Proteínas/metabolismo , Aminoácidos/genética , Animais , Íntrons , Ligantes , Fases de Leitura Aberta
19.
BMC Syst Biol ; 9 Suppl 2: S2, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25881313

RESUMO

BACKGROUND: Sufficient knowledge of molecular and genetic interactions, which comprise the entire basis of the functioning of living systems, is one of the necessary requirements for successfully answering almost any research question in the field of biology and medicine. To date, more than 24 million scientific papers can be found in PubMed, with many of them containing descriptions of a wide range of biological processes. The analysis of such tremendous amounts of data requires the use of automated text-mining approaches. Although a handful of tools have recently been developed to meet this need, none of them provide error-free extraction of highly detailed information. RESULTS: The ANDSystem package was developed for the reconstruction and analysis of molecular genetic networks based on an automated text-mining technique. It provides a detailed description of the various types of interactions between genes, proteins, microRNA's, metabolites, cellular components, pathways and diseases, taking into account the specificity of cell lines and organisms. Although the accuracy of ANDSystem is comparable to other well known text-mining tools, such as Pathway Studio and STRING, it outperforms them in having the ability to identify an increased number of interaction types. CONCLUSION: The use of ANDSystem, in combination with Pathway Studio and STRING, can improve the quality of the automated reconstruction of molecular and genetic networks. ANDSystem should provide a useful tool for researchers working in a number of different fields, including biology, biotechnology, pharmacology and medicine.


Assuntos
Mineração de Dados/métodos , Redes Reguladoras de Genes , Software , PubMed , Biologia de Sistemas/métodos
20.
BMC Syst Biol ; 9 Suppl 2: S4, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25879409

RESUMO

BACKGROUND: Pre-eclampsia is the most common complication occurring during pregnancy. In the majority of cases, it is concurrent with other pathologies in a comorbid manner (frequent co-occurrences in patients), such as diabetes mellitus, gestational diabetes and obesity. Providing bronchial asthma, pulmonary tuberculosis, certain neurodegenerative diseases and cancers as examples, we have shown previously that pairs of inversely comorbid pathologies (rare co-occurrences in patients) are more closely related to each other at the molecular genetic level compared with randomly generated pairs of diseases. Data in the literature concerning the causes of pre-eclampsia are abundant. However, the key mechanisms triggering this disease that are initiated by other pathological processes are thus far unknown. The aim of this work was to analyse the characteristic features of genetic networks that describe interactions between comorbid diseases, using pre-eclampsia as a case in point. RESULTS: The use of ANDSystem, Pathway Studio and STRING computer tools based on text-mining and database-mining approaches allowed us to reconstruct associative networks, representing molecular genetic interactions between genes, associated concurrently with comorbid disease pairs, including pre-eclampsia, diabetes mellitus, gestational diabetes and obesity. It was found that these associative networks statistically differed in the number of genes and interactions between them from those built for randomly chosen pairs of diseases. The associative network connecting all four diseases was composed of 16 genes (PLAT, ADIPOQ, ADRB3, LEPR, HP, TGFB1, TNFA, INS, CRP, CSRP1, IGFBP1, MBL2, ACE, ESR1, SHBG, ADA). Such an analysis allowed us to reveal differential gene risk factors for these diseases, and to propose certain, most probable, theoretical mechanisms of pre-eclampsia development in pregnant women. The mechanisms may include the following pathways: [TGFB1 or TNFA]-[IL1B]-[pre-eclampsia]; [TNFA or INS]-[NOS3]-[pre-eclampsia]; [INS]-[HSPA4 or CLU]-[pre-eclampsia]; [ACE]-[MTHFR]-[pre-eclampsia]. CONCLUSIONS: For pre-eclampsia, diabetes mellitus, gestational diabetes and obesity, we showed that the size and connectivity of the associative molecular genetic networks, which describe interactions between comorbid diseases, statistically exceeded the size and connectivity of those built for randomly chosen pairs of diseases. Recently, we have shown a similar result for inversely comorbid diseases. This suggests that comorbid and inversely comorbid diseases have common features concerning structural organization of associative molecular genetic networks.


Assuntos
Redes Reguladoras de Genes , Pré-Eclâmpsia/genética , Comorbidade , Mineração de Dados , Complicações do Diabetes/genética , Complicações do Diabetes/patologia , Diabetes Gestacional/genética , Diabetes Gestacional/patologia , Feminino , Regulação da Expressão Gênica , Estudos de Associação Genética , Humanos , Obesidade/complicações , Obesidade/genética , Obesidade/patologia , Pré-Eclâmpsia/metabolismo , Pré-Eclâmpsia/patologia , Gravidez , Software , Biologia de Sistemas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...