Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 71
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
J Nucl Cardiol ; 38: 101889, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38852900

RESUMEN

BACKGROUND: We developed an explainable deep-learning (DL)-based classifier to identify flow-limiting coronary artery disease (CAD) by O-15 H2O perfusion positron emission tomography computed tomography (PET/CT) and coronary CT angiography (CTA) imaging. The classifier uses polar map images with numerical data and visualizes data findings. METHODS: A DLmodel was implemented and evaluated on 138 individuals, consisting of a combined image-and data-based classifier considering 35 clinical, CTA, and PET variables. Data from invasive coronary angiography were used as reference. Performance was evaluated with clinical classification using accuracy (ACC), area under the receiver operating characteristic curve (AUC), F1 score (F1S), sensitivity (SEN), specificity (SPE), precision (PRE), net benefit, and Cohen's Kappa. Statistical testing was conducted using McNemar's test. RESULTS: The DL model had a median ACC = 0.8478, AUC = 0.8481, F1S = 0.8293, SEN = 0.8500, SPE = 0.8846, and PRE = 0.8500. Improved detection of true-positive and false-negative cases, increased net benefit in thresholds up to 34%, and comparable Cohen's kappa was seen, reaching similar performance to clinical reading. Statistical testing revealed no significant differences between DL model and clinical reading. CONCLUSIONS: The combined DL model is a feasible and an effective method in detection of CAD, allowing to highlight important data findings individually in interpretable manner.


Asunto(s)
Enfermedad de la Arteria Coronaria , Aprendizaje Profundo , Radioisótopos de Oxígeno , Tomografía Computarizada por Tomografía de Emisión de Positrones , Humanos , Masculino , Femenino , Persona de Mediana Edad , Enfermedad de la Arteria Coronaria/diagnóstico por imagen , Anciano , Tomografía Computarizada por Tomografía de Emisión de Positrones/métodos , Angiografía Coronaria/métodos , Imagen de Perfusión Miocárdica/métodos , Angiografía por Tomografía Computarizada/métodos , Isquemia Miocárdica/diagnóstico por imagen , Sensibilidad y Especificidad
2.
Methods ; 214: 35-45, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-37019293

RESUMEN

CONTEXT: Novel kinds of antibiotics are needed to combat the emergence of antibacterial resistance. Natural products (NPs) have shown potential as antibiotic candidates. Current experimental methods are not yet capable of exploring the massive, redundant, and noise-involved chemical space of NPs. In silico approaches are needed to select NPs as antibiotic candidates. OBJECTIVE: This study screens out NPs with antibacterial efficacy guided by both TCM and modern medicine and constructed a dataset aiming to serve the new antibiotic design. METHOD: A knowledge-based network is proposed in this study involving NPs, herbs, the concepts of TCM, and the treatment protocols (or etiologies) of infectious in modern medicine. Using this network, the NPs candidates are screened out and compose the dataset. Feature selection of machine learning approaches is conducted to evaluate the constructed dataset and statistically validate the im- portance of all NPs candidates for different antibiotics by a classification task. RESULTS: The extensive experiments prove the constructed dataset reaches a convincing classification performance with a 0.9421 weighted accuracy, 0.9324 recall, and 0.9409 precision. The further visu- alizations of sample importance prove the comprehensive evaluation for model interpretation based on medical value considerations.


Asunto(s)
Productos Biológicos , Medicina Tradicional China , Medicina Tradicional China/métodos , Productos Biológicos/farmacología
3.
Methods ; 209: 18-28, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36436760

RESUMEN

Sleep screening is an important tool for both healthcare and neuroscientific research. Automatic sleep scoring is an alternative to the time-consuming gold-standard manual scoring procedure. Recently there have seen promising results on automatic stage scoring by extracting spatio-temporal features via deep neural networks from electroencephalogram (EEG). However, such methods fail to consistently yield good performance due to a missing piece in data representation: the medical criterion of the sleep scoring task on top of EEG features. We argue that capturing stage-specific features that satisfy the criterion of sleep medicine is non-trivial for automatic sleep scoring. This paper considers two criteria: Transient stage marker and Overall profile of EEG features, then we propose a physiologically meaningful framework for sleep stage scoring via mixed deep neural networks. The framework consists of two sub-networks: feature extraction networks, constructed in consideration of the physiological characteristics of sleep, and an attention-based scoring decision network. Moreover, we quantize the framework for potential use under an IoT setting. For proof-of-concept, the performance of the proposed framework is demonstrated by introducing multiple sleep datasets with the largest comprising 42,560 h recorded from 5,793 subjects. From the experiment results, the proposed method achieves a competitive stage scoring performance, especially for Wake, N2, and N3, with higher F1 scores of 0.92, 0.86, and 0.88, respectively. Moreover, the feasibility analysis of framework quantization provides a potential for future implementation in the edge computing field and clinical settings.


Asunto(s)
Redes Neurales de la Computación , Sueño , Humanos , Fases del Sueño/fisiología , Electroencefalografía/métodos
4.
Skin Res Technol ; 28(3): 391-401, 2022 May.
Artículo en Inglés | MEDLINE | ID: mdl-34751451

RESUMEN

BACKGROUND: Intercellular lipids contain a lamellar structure that glows in polarized images. It could be expected that the intercellular lipid content be estimated from the luminance values calculated from polarized images of stratum corneum strips. Therefore, we attempted to develop a method for simple and rapid evaluation of the intercellular lipid content through a procedure. Herein, we demonstrated a relationship between the luminance value and the amount of ceramides, one of the main components of intercellular lipids. MATERIALS AND METHODS: The stratum corneum was collected from the forearm using slides with a pure rubber-based adhesive, which did not produce unnecessary luminescence under polarizing conditions. Images were analyzed using luminance indices. The positive secondary ion peak images were obtained using the time of flight-secondary ion mass spectrometry; the polarized and brightfield images were obtained using a polarized microscope. The ceramide and protein amount was measured by high-performance liquid chromatography and bicinchoninic acid protein assay after microscope imaging. Images and quantitative values were used to construct evaluation models based on a convolutional neural network (CNN). RESULTS: There was a correlation between the highlighted areas of the polarized image to overlap with the area where ceramide-derived peak was detected. Evaluation of the CNN-based model of the polarized images predicted the amount of ceramides per unit of stratum corneum. CONCLUSION: The method proposed in the study enabled a large number of specimens to provide a simple, rapid, and efficient evaluation of the intercellular lipid content.


Asunto(s)
Epidermis , Microscopía , Ceramidas/análisis , Cromatografía Líquida de Alta Presión , Epidermis/metabolismo , Humanos
5.
BMC Med Inform Decis Mak ; 21(1): 163, 2021 05 20.
Artículo en Inglés | MEDLINE | ID: mdl-34016115

RESUMEN

BACKGROUND: Sepsis is a severe illness that affects millions of people worldwide, and its early detection is critical for effective treatment outcomes. In recent years, researchers have used models to classify positive patients or identify the probability for sepsis using vital signs and other time-series variables as input. METHODS: In our study, we analyzed patients' conditions by their kinematics position, velocity, and acceleration, in a six-dimensional space defined by six vital signs. The patient is affected by the disease after a period if the position gets "near" to a calculated sepsis position in space. We imputed these kinematics features as explanatory variables of long short-term memory (LSTM), convolutional neural network (CNN) and linear neural network (LNN) and compared the prediction accuracies with only the vital signs as input. The dataset used contained information of approximately 4800 patients, each with 48 hourly registers. RESULTS: We demonstrated that the kinematics features models had an improved performance compared with vital signs models. The kinematics features model of LSTM achieved the best accuracy, 0.803, which was nine points higher than the vital signs model. Although with lesser accuracies, the kinematics features models of the CNN and LNN showed better performances than vital signs models. CONCLUSION: Applying our novel approach for early detection of sepsis using neural networks will prove to be an invaluable, more accurate method than considering only simple vital signs as input variables. We expect that other researchers with similar objectives can use the model presented in this innovative approach to improve their results.


Asunto(s)
Redes Neurales de la Computación , Sepsis , Fenómenos Biomecánicos , Diagnóstico Precoz , Humanos , Sepsis/diagnóstico , Signos Vitales
6.
BMC Genomics ; 21(1): 679, 2020 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-32998685

RESUMEN

BACKGROUND: Species of the genus Monascus are considered to be economically important and have been widely used in the production of yellow and red food colorants. In particular, three Monascus species, namely, M. pilosus, M. purpureus, and M. ruber, are used for food fermentation in the cuisine of East Asian countries such as China, Japan, and Korea. These species have also been utilized in the production of various kinds of natural pigments. However, there is a paucity of information on the genomes and secondary metabolites of these strains. Here, we report the genomic analysis and secondary metabolites produced by M. pilosus NBRC4520, M. purpureus NBRC4478 and M. ruber NBRC4483, which are NBRC standard strains. We believe that this report will lead to a better understanding of red yeast rice food. RESULTS: We examined the diversity of secondary metabolite production in three Monascus species (M. pilosus, M. purpureus, and M. ruber) at both the metabolome level by LCMS analysis and at the genome level. Specifically, M. pilosus NBRC4520, M. purpureus NBRC4478 and M. ruber NBRC4483 strains were used in this study. Illumina MiSeq 300 bp paired-end sequencing generated 17 million high-quality short reads in each species, corresponding to 200 times the genome size. We measured the pigments and their related metabolites using LCMS analysis. The colors in the liquid media corresponding to the pigments and their related metabolites produced by the three species were very different from each other. The gene clusters for secondary metabolite biosynthesis of the three Monascus species also diverged, confirming that M. pilosus and M. purpureus are chemotaxonomically different. M. ruber has similar biosynthetic and secondary metabolite gene clusters to M. pilosus. The comparison of secondary metabolites produced also revealed divergence in the three species. CONCLUSIONS: Our findings are important for improving the utilization of Monascus species in the food industry and industrial field. However, in view of food safety, we need to determine if the toxins produced by some Monascus strains exist in the genome or in the metabolome.


Asunto(s)
Genes de Plantas , Especiación Genética , Monascus/genética , Pigmentos Biológicos/genética , Metabolismo Secundario , Monascus/clasificación , Monascus/metabolismo , Familia de Multigenes , Filogenia , Pigmentos Biológicos/biosíntesis
7.
BMC Bioinformatics ; 20(1): 380, 2019 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-31288752

RESUMEN

BACKGROUND: Alkaloids, a class of organic compounds that contain nitrogen bases, are mainly synthesized as secondary metabolites in plants and fungi, and they have a wide range of bioactivities. Although there are thousands of compounds in this class, few of their biosynthesis pathways are fully identified. In this study, we constructed a model to predict their precursors based on a novel kind of neural network called the molecular graph convolutional neural network. Molecular similarity is a crucial metric in the analysis of qualitative structure-activity relationships. However, it is sometimes difficult for current fingerprint representations to emphasize specific features for the target problems efficiently. It is advantageous to allow the model to select the appropriate features according to data-driven decisions for extracting more useful information, which influences a classification or regression problem substantially. RESULTS: In this study, we applied a neural network architecture for undirected graph representation of molecules. By encoding a molecule as an abstract graph and applying "convolution" on the graph and training the weight of the neural network framework, the neural network can optimize feature selection for the training problem. By incorporating the effects from adjacent atoms recursively, graph convolutional neural networks can extract the features of latent atoms that represent chemical features of a molecule efficiently. In order to investigate alkaloid biosynthesis, we trained the network to distinguish the precursors of 566 alkaloids, which are almost all of the alkaloids whose biosynthesis pathways are known, and showed that the model could predict starting substances with an averaged accuracy of 97.5%. CONCLUSION: We have showed that our model can predict more accurately compared to the random forest and general neural network when the variables and fingerprints are not selected, while the performance is comparable when we carefully select 507 variables from 18000 dimensions of descriptors. The prediction of pathways contributes to understanding of alkaloid synthesis mechanisms and the application of graph based neural network models to similar problems in bioinformatics would therefore be beneficial. We applied our model to evaluate the precursors of biosynthesis of 12000 alkaloids found in various organisms and found power-low-like distribution.


Asunto(s)
Alcaloides/clasificación , Vías Biosintéticas , Redes Neurales de la Computación , Algoritmos , Alcaloides/química , Metaboloma , Modelos Teóricos
8.
Sensors (Basel) ; 19(7)2019 Apr 11.
Artículo en Inglés | MEDLINE | ID: mdl-30978955

RESUMEN

The further exploration of the capacitive ECG (cECG) is hindered by frequent fluctuations in signal quality from body movement and changes in sleep position. The processing framework must be fundamentally adapted to make full use of this signal. Therefore, we propose a new signal-processing framework that determines the signal quality for short signal segments (2 and 4 seconds) using a multi-class classification model (qua_model) based on a convolutional neural network (CNN). We built another independent deep CNN classifier (pos_model) to classify the sleep position. In the validation, 12 subjects were recruited for a 30-minute experiment, which required the subjects to lie on a bed in different sleeping positions. The short segments, classified as clear (C1 class) by the qua_model, were used to determine sleep positions with the pos_model. In 10-fold cross-validation, the qua_model for signals of 4-second length could recognize the signal of the C1 class at a 0.99 precision and a 0.99 recall; the pos_model could recognize the supine sleep position, the left, and right lateral sleep positions at a 0.99 averaged precision and a 0.99 averaged recall. Given the amount of data accumulated per night and the instability in the signal quality, this fully automatic processing framework is indispensable for a personal healthcare system. Therefore, this study could serve as an important step for cECG technique trying to explore the cECG for unconstrained heart monitoring.


Asunto(s)
Electrocardiografía , Redes Neurales de la Computación , Sueño/fisiología , Posición Supina/fisiología , Adulto , Algoritmos , Humanos , Masculino , Movimiento/fisiología , Posicionamiento del Paciente/métodos , Procesamiento de Señales Asistido por Computador , Adulto Joven
9.
BMC Bioinformatics ; 19(1): 264, 2018 07 13.
Artículo en Inglés | MEDLINE | ID: mdl-30005591

RESUMEN

BACKGROUND: There are different and complicated associations between genes and diseases. Finding the causal associations between genes and specific diseases is still challenging. In this work we present a method to predict novel associations of genes and pathways with inflammatory bowel disease (IBD) by integrating information of differential gene expression, protein-protein interaction and known disease genes related to IBD. RESULTS: We downloaded IBD gene expression data from NCBI's Gene Expression Omnibus, performed statistical analysis to determine differentially expressed genes, collected known IBD genes from DisGeNet database, which were used to construct a IBD related PPI network with HIPPIE database. We adapted our graph-based clustering algorithm DPClusO to cluster the disease PPI network. We evaluated the statistical significance of the identified clusters in the context of determining the richness of IBD genes using Fisher's exact test and predicted novel genes related to IBD. We showed 93.8% of our predictions are correct in the context of other databases and published literatures related to IBD. CONCLUSIONS: Finding disease-causing genes is necessary for developing drugs with synergistic effect targeting many genes simultaneously. Here we present an approach to identify novel disease genes and pathways and discuss our approach in the context of IBD. The approach can be generalized to find disease-associated genes for other diseases.


Asunto(s)
Redes Reguladoras de Genes , Enfermedades Inflamatorias del Intestino/genética , Algoritmos , Área Bajo la Curva , Bases de Datos Genéticas , Ontología de Genes , Humanos , Mapas de Interacción de Proteínas/genética , Curva ROC , Reproducibilidad de los Resultados
10.
J Biomed Inform ; 61: 194-202, 2016 06.
Artículo en Inglés | MEDLINE | ID: mdl-27064123

RESUMEN

Conventionally, workflows examining transcription regulation networks from gene expression data involve distinct analytical steps. There is a need for pipelines that unify data mining and inference deduction into a singular framework to enhance interpretation and hypotheses generation. We propose a workflow that merges network construction with gene expression data mining focusing on regulation processes in the context of transcription factor driven gene regulation. The pipeline implements pathway-based modularization of expression profiles into functional units to improve biological interpretation. The integrated workflow was implemented as a web application software (TransReguloNet) with functions that enable pathway visualization and comparison of transcription factor activity between sample conditions defined in the experimental design. The pipeline merges differential expression, network construction, pathway-based abstraction, clustering and visualization. The framework was applied in analysis of actual expression datasets related to lung, breast and prostrate cancer.


Asunto(s)
Minería de Datos , Regulación de la Expresión Génica , Programas Informáticos , Transcriptoma , Análisis por Conglomerados , Presentación de Datos , Humanos
11.
BMC Evol Biol ; 15: 180, 2015 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-26334309

RESUMEN

BACKGROUND: Bacterial cells have a remarkable ability to adapt to environmental changes, a phenomenon known as adaptive evolution. During adaptive evolution, phenotype and genotype dynamically changes; however, the relationship between these changes and associated constraints is yet to be fully elucidated. RESULTS: In this study, we analyzed phenotypic and genotypic changes in Escherichia coli cells during adaptive evolution to ethanol stress. Phenotypic changes were quantified by transcriptome and metabolome analyses and were similar among independently evolved ethanol tolerant populations, which indicate the existence of evolutionary constraints in the dynamics of adaptive evolution. Furthermore, the contribution of identified mutations in one of the tolerant strains was evaluated using site-directed mutagenesis. The result demonstrated that the introduction of all identified mutations cannot fully explain the observed tolerance in the tolerant strain. CONCLUSIONS: The results demonstrated that the convergence of adaptive phenotypic changes and diverse genotypic changes, which suggested that the phenotype-genotype mapping is complex. The integration of transcriptome and genome data provides a quantitative understanding of evolutionary constraints.


Asunto(s)
Evolución Biológica , Escherichia coli/genética , Etanol/farmacología , Escherichia coli/efectos de los fármacos , Perfilación de la Expresión Génica , Mutación , Fenotipo
12.
BMC Genomics ; 16: 802, 2015 Oct 16.
Artículo en Inglés | MEDLINE | ID: mdl-26474851

RESUMEN

BACKGROUND: Evolution optimizes a living system at both the genome and transcriptome levels. Few studies have investigated transcriptome evolution, whereas many studies have explored genome evolution in experimentally evolved cells. However, a comprehensive understanding of evolutionary mechanisms requires knowledge of how evolution shapes gene expression. Here, we analyzed Escherichia coli strains acquired during long-term thermal adaptive evolution. RESULTS: Evolved and ancestor Escherichia coli cells were exponentially grown under normal and high temperatures for subsequent transcriptome analysis. We found that both the ancestor and evolved cells had comparable magnitudes of transcriptional change in response to heat shock, although the evolutionary progression of their expression patterns during exponential growth was different at either normal or high temperatures. We also identified inverse transcriptional changes that were mediated by differences in growth temperatures and genotypes, as well as negative epistasis between genotype-and heat shock-induced transcriptional changes. Principal component analysis revealed that transcriptome evolution neither approached the responsive state at the high temperature nor returned to the steady state at the regular temperature. We propose that the molecular mechanisms of thermal adaptive evolution involve the optimization of steady-state transcriptomes at high temperatures without disturbing the heat shock response. CONCLUSIONS: Our results suggest that transcriptome evolution works to maintain steady-state gene expression during constrained differentiation at various evolutionary stages, while also maintaining responsiveness to environmental stimuli and transcriptome homeostasis.


Asunto(s)
Adaptación Fisiológica/genética , Escherichia coli/genética , Respuesta al Choque Térmico/genética , Transcriptoma/genética , Escherichia coli/fisiología , Evolución Molecular , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Genoma Bacteriano , Calor
13.
Plant Cell Physiol ; 56(5): 843-51, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25637373

RESUMEN

Curcuminoids, namely curcumin and its analogs, are secondary metabolites that act as the primary active constituents of turmeric (Curcuma longa). The contents of these curcuminoids vary among species in the genus Curcuma. For this reason, we compared two wild strains and two cultivars to understand the differences in the synthesis of curcuminoids. Because the fluxes of metabolic reactions depend on the amounts of their substrate and the activity of the catalysts, we analyzed the metabolite concentrations and gene expression of related enzymes. We developed a method based on RNA sequencing (RNA-Seq) analysis that focuses on a specific set of genes to detect expression differences between species in detail. We developed a 'selection-first' method for RNA-Seq analysis in which short reads are mapped to selected enzymes in the target biosynthetic pathways in order to reduce the effect of mapping errors. Using this method, we found that the difference in the contents of curcuminoids among the species, as measured by gas chromatography-mass spectrometry, could be explained by the changes in the expression of genes encoding diketide-CoA synthase, and curcumin synthase at the branching point of the curcuminoid biosynthesis pathway.


Asunto(s)
Vías Biosintéticas/genética , Curcuma/genética , Curcuma/metabolismo , Curcumina/metabolismo , Metabolómica/métodos , Análisis de Secuencia de ARN/métodos , Análisis por Conglomerados , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Redes y Vías Metabólicas/genética , Especificidad de la Especie , Transcriptoma/genética
14.
Plant Cell Physiol ; 55(1): e7, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24285751

RESUMEN

Databases (DBs) are required by various omics fields because the volume of molecular biology data is increasing rapidly. In this study, we provide instructions for users and describe the current status of our metabolite activity DB. To facilitate a comprehensive understanding of the interactions between the metabolites of organisms and the chemical-level contribution of metabolites to human health, we constructed a metabolite activity DB known as the KNApSAcK Metabolite Activity DB. It comprises 9,584 triplet relationships (metabolite-biological activity-target species), including 2,356 metabolites, 140 activity categories, 2,963 specific descriptions of biological activities and 778 target species. Approximately 46% of the activities described in the DB are related to chemical ecology, most of which are attributed to antimicrobial agents and plant growth regulators. The majority of the metabolites with antimicrobial activities are flavonoids and phenylpropanoids. The metabolites with plant growth regulatory effects include plant hormones. Over half of the DB contents are related to human health care and medicine. The five largest groups are toxins, anticancer agents, nervous system agents, cardiovascular agents and non-therapeutic agents, such as flavors and fragrances. The KNApSAcK Metabolite Activity DB is integrated within the KNApSAcK Family DBs to facilitate further systematized research in various omics fields, especially metabolomics, nutrigenomics and foodomics. The KNApSAcK Metabolite Activity DB could also be utilized for developing novel drugs and materials, as well as for identifying viable drug resources and other useful compounds.


Asunto(s)
Fenómenos Biológicos , Bases de Datos como Asunto , Metaboloma , Análisis por Conglomerados , Humanos , Estadística como Asunto
15.
J AOAC Int ; 2024 Aug 27.
Artículo en Inglés | MEDLINE | ID: mdl-39189970

RESUMEN

BACKGROUND: Reproducibility has been well-studied in the field of food analysis; the relative standard deviation is said to follow the Horwitz curve with certain exceptions. However, little systematic research has been done on predicting repeatability or intermediate precision. OBJECTIVE: We developed a regression method to estimate within-laboratory standard deviations using hierarchical Bayesian modeling and analyzing duplicate measurement data obtained from actual laboratory tests. METHODS: The Hamiltonian Monte Carlo method was employed and implemented using R with Stan. The basic structure of the statistical model was assumed to be a chi-squared distribution, the fixed effect of the predictor was assumed to be a nonlinear function with a constant term and a concentration-dependent term, and the random effects were assumed to follow a lognormal distribution as a hierarchical prior. RESULTS: By analyzing over 300 instances, we obtained regression results that fit well with the assumed model, except for moisture, which was a method-defined analyte. The developed method applies to a wide variety of analytes measured using general principles, including spectroscopy, GC, and HPLC. Although the estimated precisions were within the HorRat(r) criteria, some cases using high-sensitivity detectors, such as mass spectrometers, showed standard deviations below that range. CONCLUSION: We propose utilizing the within-laboratory precision predicted by the model established in this study for internal quality control and measurement uncertainty estimation without considering the sample matrices. HIGHLIGHTS: Performing statistical modeling on data from double analysis, which is conducted as a part of internal quality controls, will simplify the estimation of the precision that fits each analytical system in a laboratory.

16.
Microorganisms ; 12(8)2024 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-39203410

RESUMEN

In case of future viral threats, including the proposed Disease X that has been discussed since the emergence of the COVID-19 pandemic in March 2020, our research has focused on the development of antiviral strategies using fragrance compounds with known antiviral activity. Despite the recognized antiviral properties of mixtures of certain fragrance compounds, there has been a lack of a systematic approach to optimize these mixtures. Confronted with the significant combinatorial challenge and the complexity of the compound formulation space, we employed Bayesian optimization, guided by Gaussian Process Regression (GPR), to systematically explore and identify formulations with demonstrable antiviral efficacy. This approach required the transformation of the characteristics of formulations into quantifiable feature values using molecular descriptors, subsequently modeling these data to predict and propose formulations with likely antiviral efficacy enhancements. The predicted formulations underwent experimental testing, resulting in the identification of combinations capable of inactivating 99.99% of viruses, including a notably efficacious formulation of five distinct fragrance types. This model demonstrates high predictive accuracy (coefficient determination Rcv2 > 0.7) and suggests a new frontier in antiviral strategy development. Our findings indicate the powerful potential of computational modeling to surpass human analytical capabilities in the pursuit of complex, fragrance-based antiviral formulations.

17.
PLOS Digit Health ; 3(3): e0000460, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38489375

RESUMEN

The purpose of this study is to demonstrate the use of a deep learning model in quantitatively evaluating clinical findings typically subject to uncertain evaluations by physicians, using binary test results based on routine protocols. A chest X-ray is the most commonly used diagnostic tool for the detection of a wide range of diseases and is generally performed as a part of regular medical checkups. However, when it comes to findings that can be classified as within the normal range but are not considered disease-related, the thresholds of physicians' findings can vary to some extent, therefore it is necessary to define a new evaluation method and quantify it. The implementation of such methods is difficult and expensive in terms of time and labor. In this study, a total of 83,005 chest X-ray images were used to diagnose the common findings of pleural thickening and scoliosis. A novel method for quantitatively evaluating the probability that a physician would judge the images to have these findings was established. The proposed method successfully quantified the variation in physicians' findings using a deep learning model trained only on binary annotation data. It was also demonstrated that the developed method could be applied to both transfer learning using convolutional neural networks for general image analysis and a newly learned deep learning model based on vector quantization variational autoencoders with high correlations ranging from 0.89 to 0.97.

18.
Plant Cell Physiol ; 54(2): e4, 2013 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-23292603

RESUMEN

Studies on plant metabolites have attracted significant attention in recent years. Over the past 8 years, we have constructed a unique metabolite database, called KNApSAcK, that contains information on the relationships between metabolites and their expressing organism(s). In the present paper, we introduce KNApSAcK-3D, which contains the three-dimensional (3D) structures of all of the metabolic compounds included in the original KNApSAcK database. The 3D structure for each compound was optimized using the Merck Molecular Force Field (MMFF94), and a multiobjective genetic algorithm was used to search extensively for possible conformations and locate the global minimum. The resulting set of structures may be used for docking studies to identify new and potentially unexpected binding sites for target proteins. The 3D structures may also be utilized for more qualitative studies, such as the estimation of biological activities using 3D-QSAR. The database can be accessed via a link from the KNApSAcK Family website (http://kanaya.naist.jp/KNApSAcK_Family/) or directory at http://kanaya.naist.jp/knapsack3d/.


Asunto(s)
Productos Biológicos/metabolismo , Bases de Datos de Compuestos Químicos , Metaboloma , Metabolómica/métodos , Plantas/metabolismo , Programas Informáticos , Algoritmos , Antiinfecciosos/metabolismo , Sitios de Unión , Internet , Conformación Molecular , Relación Estructura-Actividad Cuantitativa
19.
Plant Cell Physiol ; 54(5): 711-27, 2013 May.
Artículo en Inglés | MEDLINE | ID: mdl-23509110

RESUMEN

Biology is increasingly becoming a data-intensive science with the recent progress of the omics fields, e.g. genomics, transcriptomics, proteomics and metabolomics. The species-metabolite relationship database, KNApSAcK Core, has been widely utilized and cited in metabolomics research, and chronological analysis of that research work has helped to reveal recent trends in metabolomics research. To meet the needs of these trends, the KNApSAcK database has been extended by incorporating a secondary metabolic pathway database called Motorcycle DB. We examined the enzyme sequence diversity related to secondary metabolism by means of batch-learning self-organizing maps (BL-SOMs). Initially, we constructed a map by using a big data matrix consisting of the frequencies of all possible dipeptides in the protein sequence segments of plants and bacteria. The enzyme sequence diversity of the secondary metabolic pathways was examined by identifying clusters of segments associated with certain enzyme groups in the resulting map. The extent of diversity of 15 secondary metabolic enzyme groups is discussed. Data-intensive approaches such as BL-SOM applied to big data matrices are needed for systematizing protein sequences. Handling big data has become an inevitable part of biology.


Asunto(s)
Bases de Datos como Asunto , Proteínas de Plantas/química , Plantas/metabolismo , Metabolismo Secundario , Alcaloides/metabolismo , Transferasas Alquil y Aril/metabolismo , Secuencia de Aminoácidos , Sistema Enzimático del Citocromo P-450/metabolismo , Flavonoides/metabolismo , Metabolómica , Péptidos/química , Plantas/enzimología
20.
Plant Cell Physiol ; 54(5): 728-39, 2013 May.
Artículo en Inglés | MEDLINE | ID: mdl-23574698

RESUMEN

Metabolomics analysis tools can provide quantitative information on the concentration of metabolites in an organism. In this paper, we propose the minimum pathway model generator tool for simulating the dynamics of metabolite concentrations (SS-mPMG) and a tool for parameter estimation by genetic algorithm (SS-GA). SS-mPMG can extract a subsystem of the metabolic network from the genome-scale pathway maps to reduce the complexity of the simulation model and automatically construct a dynamic simulator to evaluate the experimentally observed behavior of metabolites. Using this tool, we show that stochastic simulation can reproduce experimentally observed dynamics of amino acid biosynthesis in Arabidopsis thaliana. In this simulation, SS-mPMG extracts the metabolic network subsystem from published databases. The parameters needed for the simulation are determined using a genetic algorithm to fit the simulation results to the experimental data. We expect that SS-mPMG and SS-GA will help researchers to create relevant metabolic networks and carry out simulations of metabolic reactions derived from metabolomics data.


Asunto(s)
Algoritmos , Arabidopsis/metabolismo , Simulación por Computador , Redes y Vías Metabólicas , Metabolómica , Cinética , Modelos Biológicos , Análisis de Componente Principal , Procesos Estocásticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA