Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 45
Filtrar
1.
Lupus ; 29(8): 954-963, 2020 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-32517571

RESUMO

BACKGROUND: Relapses and flares with delayed wound healing are among the main symptoms of systemic lupus erythematosus (SLE), a rheumatic autoimmune disease. The orientation of immune responses in SLE disease depends on the function of the population of macrophages. This study investigated the effect of indole-3-carbinol (I3C) on transcriptional profiling of macrophage-derived monocytes (MDMs) in four stages of the wound-healing process. METHODS: In the first phase of study, MDMs were generated from peripheral blood mononuclear cells of three new SLE cases (unmedicated) and two healthy controls. The cases and controls were then divided into I3C treated and untreated groups after 24 hours of exposure to I3C. Single-end RNA sequencing was performed using an Illumina NextSeq 500 platform. After comprehensive analysis among differentially expressed genes, CDKN1A, FN1 and MMP15 were validated by quantitative real-time polymerase chain reaction as upregulated ranked genes involved in wound-healing stages. RESULTS: The RNA sequencing analysis of treated cases and treated controls versus untreated cases and untreated controls (group 3 vs. group 4) revealed upregulation of various genes, for example: C1S, C1R, IGKV1-5, IGKV4-1, SERPING1, IGLC1 and IGLC2 in coagulation; ADAM19, CEACAM1 and CEACAM8 in M2 reprogramming; IRS1, FN1, THBS1 and LIMS2 in extracellular matrix organization; and STAT1, THBS1 and ATP2A3 in the proliferation stage of wound healing. CONCLUSIONS: The results showed that treatment with I3C could modulate the gene expression involved in wound healing in SLE cases and healthy controls.


Assuntos
Indóis/farmacologia , Lúpus Eritematoso Sistêmico/tratamento farmacológico , Lúpus Eritematoso Sistêmico/genética , Macrófagos/efeitos dos fármacos , Cicatrização/efeitos dos fármacos , Adulto , Estudos de Casos e Controles , Inibidor de Quinase Dependente de Ciclina p21/genética , Feminino , Fibronectinas/genética , Perfilação da Expressão Gênica , Humanos , Indóis/uso terapêutico , Macrófagos/metabolismo , Masculino , Metaloproteinase 15 da Matriz/genética , Pessoa de Meia-Idade , Análise de Sequência de RNA , Transdução de Sinais/efeitos dos fármacos , Regulação para Cima
2.
J Dairy Res ; 85(2): 193-200, 2018 May.
Artigo em Inglês | MEDLINE | ID: mdl-29785910

RESUMO

Sub-clinical mastitis (SCM) affects milk composition. In this study, we hypothesise that large-scale mining of milk composition features by pattern recognition models can identify the best predictors of SCM within the milk composition features. To this end, using data mining algorithms, we conducted a large-scale and longitudinal study to evaluate the ability of various milk production parameters as indicators of SCM. SCM is the most prevalent disease of dairy cattle, causing substantial economic loss for the dairy industry. Developing new techniques to diagnose SCM in its early stages improves herd health and is of great importance. Test-day Somatic Cell Count (SCC) is the most common indicator of SCM and the primary mastitis surveillance approach worldwide. However, test-day SCC fluctuates widely between days, causing major concerns for its reliability. Consequently, there would be great benefit to identifying additional efficient indicators from large-scale and longitudinal studies. With this intent, data was collected at every milking (twice per day) for a period of 2 months from a single farm using in-line electronic equipment (346 248 records in total). The following data were analysed: milk volume, protein concentration, lactose concentration, electrical conductivity (EC), milking time and peak flow. Three SCC cut-offs were used to estimate the prevalence of SCM: Australian ≥ 250 000 cells/ml, European ≥200 000 cells/ml and New Zealand ≥ 150 000 cells/ml. At first, 10 different Attribute Weighting Algorithms (AWM) were applied to the data. In the absence of SCC, lactose concentration featured as the most important variable, followed by EC. For the first time, using attribute weighted modelling, we showed that the concentration of lactose in milk can be used as a strong indicator of SCM. The development of machine-learning expert systems using two or more milk variables (such as lactose concentration and EC) may produce a predictive pattern for early SCM detection.


Assuntos
Condutividade Elétrica , Lactose/análise , Mastite Bovina/diagnóstico , Leite/química , Animais , Bovinos , Contagem de Células/veterinária , Indústria de Laticínios/instrumentação , Indústria de Laticínios/métodos , Sistemas Inteligentes , Feminino , Estudos Longitudinais , Aprendizado de Máquina , Proteínas do Leite/análise
3.
BMC Genomics ; 18(1): 627, 2017 Aug 17.
Artigo em Inglês | MEDLINE | ID: mdl-28814265

RESUMO

BACKGROUND: Pistachio (Pistacia vera L.) is one of the most important commercial nut crops worldwide. It is a salt-tolerant and long-lived tree, with the largest cultivation area in Iran. Climate change and subsequent increased soil salt content have adversely affected the pistachio yield in recent years. However, the lack of genomic/global transcriptomic sequences on P. vera impedes comprehensive researches at the molecular level. Hence, whole transcriptome sequencing is required to gain insight into functional genes and pathways in response to salt stress. RESULTS: RNA sequencing of a pooled sample representing 24 different tissues of two pistachio cultivars with contrasting salinity tolerance under control and salt treatment by Illumina Hiseq 2000 platform resulted in 368,953,262 clean 100 bp paired-ends reads (90 Gb). Following creating several assemblies and assessing their quality from multiple perspectives, we found that using the annotation-based metrics together with the length-based parameters allows an improved assessment of the transcriptome assembly quality, compared to the solely use of the length-based parameters. The generated assembly by Trinity was adopted for functional annotation and subsequent analyses. In total, 29,119 contigs annotated against all of five public databases, including NR, UniProt, TAIR10, KOG and InterProScan. Among 279 KEGG pathways supported by our assembly, we further examined the pathways involved in the plant hormone biosynthesis and signaling as well as those to be contributed to secondary metabolite biosynthesis due to their importance under salinity stress. In total, 11,337 SSRs were also identified, which the most abundant being dinucleotide repeats. Besides, 13,097 transcripts as candidate stress-responsive genes were identified. Expression of some of these genes experimentally validated through quantitative real-time PCR (qRT-PCR) that further confirmed the accuracy of the assembly. From this analysis, the contrasting expression pattern of NCED3 and SOS1 genes were observed between salt-sensitive and salt-tolerant cultivars. CONCLUSION: This study, as the first report on the whole transcriptome survey of P. vera, provides important resources and paves the way for functional and comparative genomic studies on this major tree to discover the salinity tolerance-related markers and stress response mechanisms for breeding of new pistachio cultivars with more salinity tolerance.


Assuntos
Perfilação da Expressão Gênica , Genômica , Pistacia/genética , Salinidade , Sequência Conservada , Flavonoides/biossíntese , Marcadores Genéticos/genética , Repetições de Microssatélites/genética , Anotação de Sequência Molecular , Pistacia/metabolismo , Pistacia/fisiologia , Reguladores de Crescimento de Plantas/genética , Estresse Fisiológico/genética , Fatores de Transcrição/genética
4.
Mol Biol Rep ; 43(9): 923-37, 2016 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-27324248

RESUMO

Diminished ovarian reserve (DOR) is one of the reasons for infertility that not only affects both older and young women. Ovarian reserve assessment can be used as a new prognostic tool for infertility treatment decision making. Here, up- and down-regulated gene expression profiles of granulosa cells were analysed to generate a putative interaction map of the involved genes. In addition, gene ontology (GO) analysis was used to get insight intol the biological processes and molecular functions of involved proteins in DOR. Eleven up-regulated genes and nine down-regulated genes were identified and assessed by constructing interaction networks based on their biological processes. PTGS2, CTGF, LHCGR, CITED, SOCS2, STAR and FSTL3 were the key nodes in the up-regulated networks, while the IGF2, AMH, GREM, and FOXC1 proteins were key in the down-regulated networks. MIRN101-1, MIRN153-1 and MIRN194-1 inhibited the expression of SOCS2, while CSH1 and BMP2 positively regulated IGF1 and IGF2. Ossification, ovarian follicle development, vasculogenesis, sequence-specific DNA binding transcription factor activity, and golgi apparatus are the major differential groups between up-regulated and down-regulated genes in DOR. Meta-analysis of publicly available transcriptomic data highlighted the high coexpression of CTGF, connective tissue growth factor, with the other key regulators of DOR. CTGF is involved in organ senescence and focal adhesion pathway according to GO analysis. These findings provide a comprehensive system biology based insight into the aetiology of DOR through network and gene ontology analyses.


Assuntos
Infertilidade Feminina/genética , Reserva Ovariana/genética , Feminino , Expressão Gênica , Ontologia Genética , Redes Reguladoras de Genes , Humanos , Infertilidade Feminina/metabolismo , Mapas de Interação de Proteínas
5.
J Theor Biol ; 368: 122-32, 2015 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-25591889

RESUMO

For the first time, prediction accuracies of some supervised and unsupervised algorithms were evaluated in an SSR-based DNA fingerprinting study of a pea collection containing 20 cultivars and 57 wild samples. In general, according to the 10 attribute weighting models, the SSR alleles of PEAPHTAP-2 and PSBLOX13.2-1 were the two most important attributes to generate discrimination among eight different species and subspecies of genus Pisum. In addition, K-Medoids unsupervised clustering run on Chi squared dataset exhibited the best prediction accuracy (83.12%), while the lowest accuracy (25.97%) gained as K-Means model ran on FCdb database. Irrespective of some fluctuations, the overall accuracies of tree induction models were significantly high for many algorithms, and the attributes PSBLOX13.2-3 and PEAPHTAP could successfully detach Pisum fulvum accessions and cultivars from the others when two selected decision trees were taken into account. Meanwhile, the other used supervised algorithms exhibited overall reliable accuracies, even though in some rare cases, they gave us low amounts of accuracies. Our results, altogether, demonstrate promising applications of both supervised and unsupervised algorithms to provide suitable data mining tools regarding accurate fingerprinting of different species and subspecies of genus Pisum, as a fundamental priority task in breeding programs of the crop.


Assuntos
Genes de Plantas , Modelos Genéticos , Pisum sativum/genética , Algoritmos , Teorema de Bayes , Análise por Conglomerados , Impressões Digitais de DNA/métodos , DNA de Plantas/genética , Árvores de Decisões , Marcadores Genéticos , Genótipo , Repetições de Microssatélites , Especificidade da Espécie , Máquina de Vetores de Suporte
6.
Mol Biol Rep ; 42(9): 1377-90, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26246405

RESUMO

Pandemic influenza remains as a substantial threat to humans with a widespread panic worldwide. In contrast, seasonal (non-pandemic) has a mild non-lethal infection each year. The underlying mechanisms governing the detrimental effects of pandemic influenza are yet to be known. Transcriptomic-based network discovery and gene ontology (GO) analysis of host response to pandemic influenza, compared to seasonal influenza, can shed light on the differential mechanisms which pandemic influenza is employed during evolution. Here, using microarray data of infected ferrets with pandemic and seasonal influenza (as a model), we evaluated the possible link between altered genes after pandemic infection with activation of neuronal disorders. To this end, we utilized novel computational biology techniques including differential transcriptome analysis, network construction, GO enrichment, and GO network to investigate the underlying mechanisms of pandemic influenza infection and host interaction. In comparison to seasonal influenza, pandemic influenza differentially altered the expression of 31 genes with direct involvement in activity of central nervous system (CNS). Network topology highlighted the high interactions of IRF1, NKX2-1 and NR5A1 as well as MIR27A, MIR19A, and MIR17. TGFB2, NCOA3 and SP1 were the central transcription factors in the networks. Pandemic influenza remarkably downregulated GPM6A and GTPase. GO network demonstrated the key roles of GPM6A and GTPase in regulation of important functions such as synapse assembly and neuron projection. For the first time, we showed that besides interference with cytokine/chemokine storm and neuraminidase enzyme, H1N1 pandemic influenza is able to directly affect neuronal gene networks. The possibility of application of some key regulators such as GPM6A protein, MIR128, and MIR367 as candidate therapeutic agents is discussed. The presented approach established a new way to unravel unknown pathways in virus-associated CNS dysfunction by utilizing global transcriptomic data, network and GO analysis.


Assuntos
Redes Reguladoras de Genes , Vírus da Influenza A Subtipo H1N1 , Proteínas do Tecido Nervoso/genética , Neurônios/metabolismo , Infecções por Orthomyxoviridae/genética , Animais , Biologia Computacional , Citocinas/metabolismo , Modelos Animais de Doenças , Regulação para Baixo , Furões , Perfilação da Expressão Gênica , Ontologia Genética , Neuraminidase/metabolismo , Infecções por Orthomyxoviridae/epidemiologia , Infecções por Orthomyxoviridae/metabolismo , Pandemias
7.
J Theor Biol ; 356: 213-22, 2014 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-24819464

RESUMO

Due to the central roles of lipid binding proteins (LBPs) in many biological processes, sequence based identification of LBPs is of great interest. The major challenge is that LBPs are diverse in sequence, structure, and function which results in low accuracy of sequence homology based methods. Therefore, there is a need for developing alternative functional prediction methods irrespective of sequence similarity. To identify LBPs from non-LBPs, the performances of support vector machine (SVM) and neural network were compared in this study. Comprehensive protein features and various techniques were employed to create datasets. Five-fold cross-validation (CV) and independent evaluation (IE) tests were used to assess the validity of the two methods. The results indicated that SVM outperforms neural network. SVM achieved 89.28% (CV) and 89.55% (IE) overall accuracy in identification of LBPs from non-LBPs and 92.06% (CV) and 92.90% (IE) (in average) for classification of different LBPs classes. Increasing the number and the range of extracted protein features as well as optimization of the SVM parameters significantly increased the efficiency of LBPs class prediction in comparison to the only previous report in this field. Altogether, the results showed that the SVM algorithm can be run on broad, computationally calculated protein features and offers a promising tool in detection of LBPs classes. The proposed approach has the potential to integrate and improve the common sequence alignment based methods.


Assuntos
Algoritmos , Proteínas de Transporte , Lipídeos , Modelos Genéticos , Redes Neurais de Computação , Análise de Sequência de Proteína/métodos , Máquina de Vetores de Suporte , Animais , Proteínas de Transporte/química , Proteínas de Transporte/classificação , Proteínas de Transporte/genética , Humanos , Ligação Proteica
8.
BMC Med Genomics ; 16(1): 219, 2023 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-37715225

RESUMO

BACKGROUND: The largest group of patients with breast cancer are estrogen receptor-positive (ER+) type. The estrogen receptor acts as a transcription factor and triggers cell proliferation and differentiation. Hence, investigating ER-DNA interaction genomic regions can help identify genes directly regulated by ER and understand the mechanism of ER action in cancer progression. METHODS: In the present study, we employed a workflow to do a meta-analysis of ChIP-seq data of ER+ cell lines stimulated with 10 nM and 100 nM of E2. All publicly available data sets were re-analyzed with the same platform. Then, the known and unknown batch effects were removed. Finally, the meta-analysis was performed to obtain meta-differentially bound sites in estrogen-treated MCF7 cell lines compared to vehicles (as control). Also, the meta-analysis results were compared with the results of T47D cell lines for more precision. Enrichment analyses were also employed to find the functional importance of common meta-differentially bound sites and associated genes among both cell lines. RESULTS: Remarkably, POU5F1B, ZNF662, ZNF442, KIN, ZNF410, and SGSM2 transcription factors were recognized in the meta-analysis but not in individual studies. Enrichment of the meta-differentially bound sites resulted in the candidacy of pathways not previously reported in breast cancer. PCGF2, HNF1B, and ZBED6 transcription factors were also predicted through the enrichment analysis of associated genes. In addition, comparing the meta-analysis results of both ChIP-seq and RNA-seq data showed that many transcription factors affected by ER were up-regulated. CONCLUSION: The meta-analysis of ChIP-seq data of estrogen-treated MCF7 cell line leads to the identification of new binding sites of ER that have not been previously reported. Also, enrichment of the meta-differentially bound sites and their associated genes revealed new terms and pathways involved in the development of breast cancer which should be examined in future in vitro and in vivo studies.


Assuntos
Neoplasias da Mama , Receptor alfa de Estrogênio , Humanos , Feminino , Receptor alfa de Estrogênio/genética , Neoplasias da Mama/genética , Receptores de Estrogênio , Sequenciamento de Cromatina por Imunoprecipitação , Transcriptoma , Genômica , Estrogênios
9.
Cogn Neurodyn ; 16(6): 1335-1349, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36408064

RESUMO

Accurate diagnosis of Attention-Deficit/Hyperactivity Disorder (ADHD) is a significant challenge. Misdiagnosis has significant negative medical side effects. Due to the complex nature of this disorder, there is no computational expert system for diagnosis. Recently, automatic diagnosis of ADHD by machine learning analysis of brain signals has received an increased attention. This paper aimed to achieve an accurate model to discriminate between ADHD patients and healthy controls by pattern discovery. Event-Related Potentials (ERP) data were collected from ADHD patients and healthy controls. After pre-processing, ERP signals were decomposed and features were calculated for different frequency bands. The classification was carried out based on each feature using seven machine learning algorithms. Important features were then selected and combined. To find specific patterns for each model, the classification was repeated using the proposed patterns. Results indicated that the combination of complementary features can significantly improve the performance of the predictive models. The newly developed features, defined based on band power, were able to provide the best classification using the Generalized Linear Model, Logistic Regression, and Deep Learning with the average accuracy and Receiver operating characteristic curve > %99.85 and > 0.999, respectively. High and low frequencies (Beta, Delta) performed better than the mid, frequencies in the discrimination of ADHD from control. Altogether, this study developed a machine learning expert system that minimises misdiagnosis of ADHD and is beneficial for the evaluation of treatment efficacy.

10.
Life (Basel) ; 12(7)2022 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-35888047

RESUMO

Morphology and feature selection are key approaches to address several issues in fisheries science and stock management, such as the hypothesis of admixture of Caspian common carp (Cyprinus carpio) and farmed carp stocks in Iran. The present study was performed to investigate the population classification of common carp in the southern Caspian basin using data mining algorithms to find the most important characteristic(s) differing between Iranian and farmed common carp. A total of 74 individuals were collected from three locations within the southern Caspian basin and from one farm between November 2015 and April 2016. A dataset of 26 traditional morphometric (TMM) attributes and a dataset of 14 geometric landmark points were constructed and then subjected to various machine learning methods. In general, the machine learning methods had a higher prediction rate with TMM datasets. The highest decision tree accuracy of 77% was obtained by rule and decision tree parallel algorithms, and "head height on eye area" was selected as the best marker to distinguish between wild and farmed common carp. Various machine learning algorithms were evaluated, and we found that the linear discriminant was the best method, with 81.1% accuracy. The results obtained from this novel approach indicate that Darwin's domestication syndrome is observed in common carp. Moreover, they pave the way for automated detection of farmed fish, which will be most beneficial to detect escapees and improve restocking programs.

11.
Comput Biol Med ; 134: 104471, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34004573

RESUMO

SARS-COV-2, Severe Acute Respiratory Syndrome (SARS), and the Middle East respiratory syndrome-related coronavirus (MERS) viruses are from the coronaviridae family; the former became a global pandemic (with low mortality rate) while the latter were confined to a limited region (with high mortality rates). To investigate the possible structural differences at basic levels for the three viruses, genomic and proteomic sequences were downloaded and converted to polynomial datasets. Seven attribute weighting (feature selection) models were employed to find the key differences in their genome's nucleotide sequence. Most attribute weighting models selected the final nucleotide sequences (from 29,000th nucleotide positions to the end of the genome) as significantly different among the three virus classes. The genome and proteome sequences of this hot zone area (which corresponds to the 3'UTR region and encodes for nucleoprotein (N)) and Spike (S) protein sequences (as the most important viral protein) were converted into binary images and were analyzed by image processing techniques and Convolutional deep Neural Network (CNN). Although the predictive accuracy of CNN for Spike (S) proteins was low (0.48%), the machine-based learning algorithms were able to classify the three members of coronaviridae viruses with 100% accuracy based on 3'UTR region. For the first time ever, the relationship between the possible structural differences of coronaviruses at the sequential levels and their pathogenesis are being reported, which paves the road to deciphering the high pathogenicity of the SARS-COV-2 virus.


Assuntos
COVID-19 , Coronavírus da Síndrome Respiratória do Oriente Médio , Humanos , Coronavírus da Síndrome Respiratória do Oriente Médio/genética , Pandemias , Proteômica , SARS-CoV-2
12.
Comput Biol Med ; 138: 104893, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34598069

RESUMO

Understanding the underlying molecular mechanism of transporter activity is one of the major discussions in structural biology. A transporter can exclusively transport one ion (specific transporter) or multiple ions (general transporter). This study compared categorical and numerical features of general and specific calcium transporters using machine learning and attribute weighting models. To this end, 444 protein features, such as the frequency of dipeptides, organism, and subcellular location, were extracted for general (n = 103) and specific calcium transporters (n = 238). Aliphatic index, subcellular location, organism, Ile-Leu frequency, Glycine frequency, hydrophobic frequency, and specific dipeptides such as Ile-Leu, Phe-Val, and Tyr-Gln were the key features in differentiating general from specific calcium transporters. Calcium transporters in the cell outer membranes were specific, while the inner ones were general; additionally, when the hydrophobic frequency or Aliphatic index is increased, the calcium transporter act as a general transporter. Random Forest with accuracy criterion showed the highest accuracy (88.88% ±5.75%) and high AUC (0.964 ± 0.020), based on 5-fold cross-validation. Decision Tree with accuracy criterion was able to predict the specificity of calcium transporter irrespective of the organism and subcellular location. This study demonstrates the precise classification of transporter function based on sequence-derived physicochemical features.


Assuntos
Aprendizado de Máquina
13.
Environ Monit Assess ; 168(1-4): 575-85, 2010 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-19711190

RESUMO

Concentration of heavy metals in aquatic animals mainly occurs due to industrial contamination. In this study, the concentrations of four heavy metals (cadmium, lead, mercury, and arsenic) in organs of two cyprinid fish and in water collected from three sections of the Kor River, Iran were determined using the inductively coupled plasma method. Pathological and hormonal changes due to metal contamination were also measured. The concentrations of heavy metals in tissue of fish from the middle sampling zone were significantly higher (p < 0.05) than those from the other two sampling zones, whereas no significant differences (p > 0.05) were detected between the two sexes and species. High levels of metals were found in the ovaries and testes; estradiol in females and progesterone and testosterone in males from the middle study site were significantly (p < 0.05) lower than values from the other two sites. Pathological changes in blood cells, liver, and kidneys of fishes were significantly higher in highly polluted areas (middle sampling zone). These results show that industrial activities have polluted the river and that the maximum concentrations of Cd, Pb, and Hg were higher than the permissible levels for human consumption.


Assuntos
Cyprinidae/metabolismo , Monitoramento Ambiental , Metais Pesados/metabolismo , Poluentes Químicos da Água/metabolismo , Animais , Arsênio/metabolismo , Cádmio/metabolismo , Carpas/metabolismo , Feminino , Ferro , Chumbo/metabolismo , Masculino , Mercúrio/metabolismo , Ovário/metabolismo , Fatores Sexuais , Testículo/metabolismo
15.
Comput Biol Med ; 117: 103584, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-32072976

RESUMO

Different bioinformatic and data-mining approaches have been used for the analysis of proteins. Here, we describe a novel, robust, and reliable approach for comparative analysis of a large number of proteins by combining Image Processing Techniques and Convolutional Deep Neural Network (IPT-CNN). As proof of principle, we used IPT-CNN to predict different subtypes of Influenza A virus (IAV). Over 8000 sequences of surface proteins haemagglutinin (HA) and neuraminidase (NA) from different IAV subtypes were used to create polynomial or binary vector datasets. The datasets were then converted into binary images. Analysis of these images enabled the classification of IAV subtypes with 100% accuracy and, compared to non-image-based approaches, within a shorter time frame. The proteome-based IPT-CNN approach described here may be used for analysis and proteome-based classification of other proteins.


Assuntos
Processamento de Imagem Assistida por Computador , Redes Neurais de Computação , Algoritmos
16.
Infect Genet Evol ; 71: 224-231, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-30953716

RESUMO

Plasmodium vivax, an intracellular protozoan, causes malaria which is characterized by fever, anemia, respiratory distress, liver and spleen enlargement. In spite of attempts to design an efficient vaccine, there is not a vaccine against P. vivax. Notable advances have recently achieved in the development of malaria vaccines targeting the surface antigens such as Apical Membrane Antigens (AMA)-1. AMA-1 is a micronemal protein synthesized during the erythrocyte-stage of Plasmodium species and plays a significant role in the invasion process of the parasite into host cells. P. vivax AMA-1 (PvAMA-1) can induce strong cellular and humoral responses, indicating that it can be an ideal candidate of vaccine against malaria. Identification and prediction of proteins characteristics increase our knowledge about them and leads to develop vaccine and diagnostic studies. In the present study several valid bioinformatics tools were applied to analyze the various characteristics of AMA-1 such as physical and chemical properties, secondary and tertiary structures, B- cell and T-cell prediction and other important features in order to introduce potential epitopes for designing a high-efficient vaccine. The results demonstrated that this protein had 57 potential PTM sites and only one transmembrane domain on its sequence. Also, multiple hydrophilic regions and classical high hydrophilic domains were predicted. Secondary structure prediction revealed that the proportions of random coil, alpha-helix and extended strand in the AMA-1 sequence were 53.74%, 27.22%, and 19.4%, respectively. Moreover, 5 disulfide bonds were predicted at positions 14-21aa, 162-192aa, 208-220aa, 247-265aa and 354-363aa. The data obtained from B-cell and T-cell epitopes prediction showed that there were several potential epitopes on AMA-1 that can be proper targets for diagnostic and vaccine studies. The current study presented interesting basic and theoretical information regarding PvAMA-1, being important for further studies in order to design a high-efficiency vaccine against malaria.


Assuntos
Antígenos de Protozoários/genética , Proteínas de Membrana/genética , Plasmodium vivax/genética , Proteínas de Protozoários/genética , Animais , Antígenos de Protozoários/imunologia , Biologia Computacional , Epitopos de Linfócito B/genética , Epitopos de Linfócito B/imunologia , Epitopos de Linfócito T/genética , Epitopos de Linfócito T/imunologia , Humanos , Malária Vivax/tratamento farmacológico , Malária Vivax/prevenção & controle , Proteínas de Membrana/imunologia , Plasmodium vivax/imunologia , Proteínas de Protozoários/imunologia , Vacinas/síntese química
17.
Comput Biol Med ; 114: 103456, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31605926

RESUMO

Sub-clinical bovine mastitis decreases milk quality and production. Moreover, sub-clinical mastitis leads to the use of antibiotics with consequent increased risk of the emergence of antibiotic-resistant bacteria. Therefore, early detection of infected cows is of great importance. The Somatic Cell Count (SCC) day-test used for mastitis surveillance, gives data that fluctuate widely between days, creating questions about its reliability and early prediction power. The recent identification of risk parameters of sub-clinical mastitis based on milking parameters by machine learning models is emerging as a promising new tool to enhance early prediction of mastitis occurrence. To develop the optimal approach for early sub-clinical mastitis prediction, we implemented 2 steps: (1) Finding the best statistical models to accurately link patterns of risk factors to sub-clinical mastitis, and (2) Extending this application from the farms tested to new farms (method generalization). Herein, we applied various machine learning-based prediction systems on a big milking dataset to uncover the best predictive models of sub-clinical mastitis. Data from 364,249 milking instances were collected by an electronic automated in-line monitoring system where milk volume, lactose concentration, electrical conductivity (EC), protein concentration, peak flow and milking time for each sample were measured. To provide a platform for the application of the models developed to other farms, the Z transformation approach was employed. Following this, various prediction systems [Deep Learning (DL), Naïve Bayes, Generalized Liner Model, Logistic Regression, Decision Tree, Gradient-Boosted Tree (GBT) and Random Forest] were applied to the non-transformed milking dataset and to a Z-standardized dataset. ROC (Receiver Operating Characteristics Curve), AUC (Area Under The Curve), and high accuracy demonstrated the high sensitivity of GBT and DL in detecting sub-clinical mastitis. GBT was the most accurate model (accuracy of 84.9%) in prediction of sub-clinical bovine mastitis. These data demonstrate how these models could be applied for prediction of sub-clinical mastitis in multiple bovine herds regardless of the size and sampling techniques.


Assuntos
Infecções Assintomáticas , Aprendizado Profundo , Diagnóstico por Computador/métodos , Mastite Bovina/diagnóstico , Animais , Bovinos , Árvores de Decisões , Diagnóstico Precoce , Feminino , Leite/química
18.
Gene ; 691: 114-124, 2019 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-30620887

RESUMO

Biosynthesis of secondary metabolites in plant is a complex process, regulated by many genes and influenced by several factors. In recent years, the next-generation sequencing (NGS) technology and advanced statistical analysis such as meta-analysis and computational systems biology have provided novel opportunities to overcome biological complexity. Here, we performed a meta-analysis on publicly available transcriptome datasets of twelve economically significant medicinal plants to identify differentially expressed genes (DEGs) between shoot and root tissues and to find the key molecular features which may be effective in the biosynthesis of secondary metabolites. Meta-analysis identified a total of 880 genes with differential expression between two tissues. Functional enrichment and KEGG pathway analysis indicated that the functions of those DEGs are highly associated with the developmental process, starch metabolic process, response to stimulus, porphyrin and chlorophyll metabolism, biosynthesis of secondary metabolites and phenylalanine metabolism. In addition, systems biology analysis of the DEGs was applied to find protein-protein interaction network and discovery of significant modules. The detected modules were associated with hormone signal transduction, transcription repressor activity, response to light stimulus and epigenetic processes. Finally, analysis was extended to search for putative miRNAs that are associated with DEGs. A total of 31 miRNAs were detected which belonged to 16 conserved families. The present study provides a comprehensive view to better understand the tissue-specific expression of genes and mechanisms involved in secondary metabolites synthesis and may provide candidate genes for future researches to improve yield of secondary metabolites.


Assuntos
Perfilação da Expressão Gênica/métodos , Marcadores Genéticos , Proteínas de Plantas/genética , Plantas Medicinais/genética , Regulação da Expressão Gênica de Plantas , Redes Reguladoras de Genes , MicroRNAs/genética , Especificidade de Órgãos , Mapas de Interação de Proteínas , Metabolismo Secundário , Análise de Sequência de RNA , Biologia de Sistemas
20.
J Parasit Dis ; 42(2): 269-276, 2018 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-29844632

RESUMO

Toxoplasma gondii, is a causative agent of morbidity and mortality in immunocompromised and congenitally-infected individuals. Attempts to construct DNA vaccines against T. gondii using surface proteins are increasing. The dense granule antigens are highly expressed in the acute and chronic phases of T. gondii infection and considered as suitable DNA vaccine candidates to control toxoplasmosis. In the present study, bioinformatics tools and online software were used to predict, analyze and compare the structural, physical and chemical characters and immunogenicity of the GRA-1, GRA-4, GRA-6 and GRA-7 proteins. Sequence alignment results indicated that the GRA-1, GRA-4, GRA-6 and GRA-7 proteins had low similarity. The secondary structure prediction demonstrated that among the four proteins, GRA-1 and GRA-6 had similar secondary structure except for a little discrepancy. Hydrophilicity/hydrophobicity analysis showed multiple hydrophilic regions and some classical high hydrophilic domains for each protein sequence. Immunogenic epitope prediction results demonstrated that the GRA-1 and GRA-4 epitopes were stable and GRA-4 showed the highest degree of antigenicity. Although the GRA-7 epitope had the highest score of immunogenicity, this epitope was instable and had the lowest degree of antigenicity and half-time in eukaryotic cell. Also, the results indicated that GRA4-GRA7 epitope and GRA6-GRA7 had the highest degree of antigenicity and immunogenicity among multi-hybrid epitopes, respectively. Totally, in the present study, single epitopes showed the highest degree of antigenicity compared with multi-hybrid epitopes. Given the results, it can be concluded that GRA-4 and GRA-7 can be powerful DNA vaccine candidates against T. gondii.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa