Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Sensors (Basel) ; 23(24)2023 Dec 18.
Artículo en Inglés | MEDLINE | ID: mdl-38139735

RESUMEN

Participatory exposure research, which tracks behaviour and assesses exposure to stressors like air pollution, traditionally relies on time-activity diaries. This study introduces a novel approach, employing machine learning (ML) to empower laypersons in human activity recognition (HAR), aiming to reduce dependence on manual recording by leveraging data from wearable sensors. Recognising complex activities such as smoking and cooking presents unique challenges due to specific environmental conditions. In this research, we combined wearable environment/ambient and wrist-worn activity/biometric sensors for complex activity recognition in an urban stressor exposure study, measuring parameters like particulate matter concentrations, temperature, and humidity. Two groups, Group H (88 individuals) and Group M (18 individuals), wore the devices and manually logged their activities hourly and minutely, respectively. Prioritising accessibility and inclusivity, we selected three classification algorithms: k-nearest neighbours (IBk), decision trees (J48), and random forests (RF), based on: (1) proven efficacy in existing literature, (2) understandability and transparency for laypersons, (3) availability on user-friendly platforms like WEKA, and (4) efficiency on basic devices such as office laptops or smartphones. Accuracy improved with finer temporal resolution and detailed activity categories. However, when compared to other published human activity recognition research, our accuracy rates, particularly for less complex activities, were not as competitive. Misclassifications were higher for vague activities (resting, playing), while well-defined activities (smoking, cooking, running) had few errors. Including environmental sensor data increased accuracy for all activities, especially playing, smoking, and running. Future work should consider exploring other explainable algorithms available on diverse tools and platforms. Our findings underscore ML's potential in exposure studies, emphasising its adaptability and significance for laypersons while also highlighting areas for improvement.


Asunto(s)
Dispositivos Electrónicos Vestibles , Humanos , Salud Urbana , Investigación Participativa Basada en la Comunidad , Biometría , Poder Psicológico
2.
Ergonomics ; 66(8): 1164-1175, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-36269073

RESUMEN

The forefoot is the foot part most affected by ill-fitting shoes. Footwear fitting considers the measurements of length, width, and arch length. Toe shape has not yet been used in sizing feet and fitting shoes. This study aims to investigate the variation in toe shape, as measured by the hallux valgus angle. An automatic and reproducible hallux valgus angle measuring method using 3D foot scans with no palpation markers is proposed and applied to about half a million samples collected across North America, Europe, and Asia. The measuring method is robust and can detect the medial contour along the proximal phalanx even in extreme cases. The hallux valgus angle has a normal distribution with long tails on both sides in the general population. Large dispersions of HVA values were observed for both genders and in all three geographical regions. Practitioner summary: The hallux valgus angle has a broad distribution in the general population. Females have larger hallux valgus angles than males, and people from Asia have larger hallux valgus angles than people from North America and Europe. Shoe toe boxes should be designed to fit the actual shapes of shoppers' toes. The proposed method for measuring HVA opens a new opportunity to study the causal relationship between shoe wearing habits and HVA on a large scale.


Asunto(s)
Hallux Valgus , Humanos , Femenino , Masculino , Hallux Valgus/diagnóstico por imagen , Hallux Valgus/etiología , Pie/diagnóstico por imagen , Dedos del Pie , Europa (Continente) , Mano
3.
Sensors (Basel) ; 22(20)2022 Oct 19.
Artículo en Inglés | MEDLINE | ID: mdl-36298313

RESUMEN

Robot assembly tasks can fail due to unpredictable errors and can only continue with the manual intervention of a human operator. Recently, we proposed an exception strategy learning framework based on statistical learning and context determination, which can successfully resolve such situations. This paper deals with context determination from multimodal data, which is the key component of our framework. We propose a novel approach to generate unified low-dimensional context descriptions based on image and force-torque data. For this purpose, we combine a state-of-the-art neural network model for image segmentation and contact point estimation using force-torque measurements. An ensemble of decision trees is used to combine features from the two modalities. To validate the proposed approach, we have collected datasets of deliberately induced insertion failures both for the classic peg-in-hole insertion task and for an industrially relevant task of car starter assembly. We demonstrate that the proposed approach generates reliable low-dimensional descriptors, suitable as queries necessary in statistical learning.


Asunto(s)
Procesamiento de Imagen Asistido por Computador , Redes Neurales de la Computación , Humanos , Procesamiento de Imagen Asistido por Computador/métodos
4.
J Dairy Sci ; 102(11): 10639-10656, 2019 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-31447146

RESUMEN

Nutrient management on grazed grasslands is of critical importance to maintain productivity levels, as grass is the cheapest feed for ruminants and underpins these meat and milk production systems. Many attempts have been made to model the relationships between controllable (crop and soil fertility management) and noncontrollable influencing factors (weather, soil drainage) and nutrient/productivity levels. However, to the best of our knowledge not much research has been performed on modeling the interconnections between the influencing factors on one hand and nutrient uptake/herbage production on the other hand, by using data-driven modeling techniques. Our paper proposes to use predictive clustering trees (PCT) learned for building models on data from dairy farms in the Republic of Ireland. The PCT models show good accuracy in estimating herbage production and nutrient uptake. They are also interpretable and are found to embody knowledge that is in accordance with existing theoretical understanding of the task at hand. Moreover, if we combine more PCT into an ensemble of PCT (random forest of PCT), we can achieve improved accuracy of the estimates. In practical terms, the number of grazings, which is related proportionally with soil drainage class, is one of the most important factors that moderates the herbage production potential and nutrient uptake. Furthermore, we found the nutrient (N, P, and K) uptake and herbage nutrient concentration to be conservative in fields that had medium yield potential (11 t of dry matter per hectare on average), whereas nutrient uptake was more variable and potentially limiting in fields that had higher and lower herbage production. Our models also show that phosphorus is the most limiting nutrient for herbage production across the fields on these Irish dairy farms, followed by nitrogen and potassium.


Asunto(s)
Alimentación Animal , Bovinos/metabolismo , Industria Lechera/métodos , Aprendizaje Automático , Nutrientes/metabolismo , Alimentación Animal/análisis , Animales , Dieta/veterinaria , Femenino , Irlanda , Lactancia , Leche , Poaceae
5.
Int J Mol Sci ; 19(5)2018 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-29734672

RESUMEN

Glioblastoma (GB) is the most aggressive brain malignancy. Although some potential glioblastoma biomarkers have already been identified, there is a lack of cell membrane-bound biomarkers capable of distinguishing brain tissue from glioblastoma and/or glioblastoma stem cells (GSC), which are responsible for the rapid post-operative tumor reoccurrence. In order to find new GB/GSC marker candidates that would be cell surface proteins (CSP), we have performed meta-analysis of genome-scale mRNA expression data from three data repositories (GEO, ArrayExpress and GLIOMASdb). The search yielded ten appropriate datasets, and three (GSE4290/GDS1962, GSE23806/GDS3885, and GLIOMASdb) were used for selection of new GB/GSC marker candidates, while the other seven (GSE4412/GDS1975, GSE4412/GDS1976, E-GEOD-52009, E-GEOD-68848, E-GEOD-16011, E-GEOD-4536, and E-GEOD-74571) were used for bioinformatic validation. The selection identified four new CSP-encoding candidate genes—CD276, FREM2, SPRY1, and SLC47A1—and the bioinformatic validation confirmed these findings. A review of the literature revealed that CD276 is not a novel candidate, while SLC47A1 had lower validation test scores than the other new candidates and was therefore not considered for experimental validation. This validation revealed that the expression of FREM2—but not SPRY1—is higher in glioblastoma cell lines when compared to non-malignant astrocytes. In addition, FREM2 gene and protein expression levels are higher in GB stem-like cell lines than in conventional glioblastoma cell lines. FREM2 is thus proposed as a novel GB biomarker and a putative biomarker of glioblastoma stem cells. Both FREM2 and SPRY1 are expressed on the surface of the GB cells, while SPRY1 alone was found overexpressed in the cytosol of non-malignant astrocytes.


Asunto(s)
Biomarcadores de Tumor/genética , Proteínas de la Matriz Extracelular/genética , Glioblastoma/genética , Proteínas de la Membrana/genética , Fosfoproteínas/genética , Astrocitos/metabolismo , Línea Celular Tumoral , Regulación Neoplásica de la Expresión Génica , Glioblastoma/metabolismo , Glioblastoma/patología , Humanos , Células Madre Neoplásicas/metabolismo , Células Madre Neoplásicas/patología , Proteómica
6.
BMC Urol ; 16(1): 35, 2016 Jul 04.
Artículo en Inglés | MEDLINE | ID: mdl-27377958

RESUMEN

BACKGROUND: TMPRSS2:ERG gene aberration may be a novel marker that improves risk stratification of prostate cancer before definitive cancer therapy, but studies have been inconclusive. METHODS: The study cohort consisted of 202 operable prostate cancer Slovenian patients who underwent laparoscopic radical prostatectomy. We retrospectively constructed tissue microarrays of their prostatic specimens for fluorescence in situ hybridization, with appropriate signals obtained in 148 patients for subsequent statistical analyses. RESULTS: The following genetic aberrations were found: TMPRSS2:ERG fusion, TMPRSS2 split (a non-ERG translocation) and ERG split (an ERG translocation without involvement of TMPRSS2). TMPRSS2:ERG gene fusion happened in 63 patients (42 %), TMPRSS2 split in 12 patients and ERG split in 8 patients. Association was tested between TMPRSS2:ERG gene fusion and several clinicopathological variables, i.e., pT stage, extended lymph node dissection status, and Gleason score, correcting for multiple comparisons. Only the association with pT stage was significant at p = 0.05: Of 62 patients with pT3 stage, 34 (55 %) had TMPRSS2:ERG gene fusion. In pT3 stage patients, stronger (but not significant) association between eLND status and TMPRSS2:ERG gene fusion was detected. We detected TMPRSS2:ERG gene fusion in 64 % of the pT3 stage patients where we did not perform an extended lymph node dissection. CONCLUSIONS: Our results indicate that it is possible to predict pT3 stage at final histology from TMPRSS2:ERG gene fusion at initial core needle biopsy. FISH determination of TMPRSS2:ERG gene fusion may be particularly useful for patients scheduled to undergo a radical prostatectomy in order to improve oncological and functional results.


Asunto(s)
Fusión Génica , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/patología , Serina Endopeptidasas/genética , Anciano , Estudios de Cohortes , Humanos , Masculino , Persona de Mediana Edad , Estadificación de Neoplasias , Estudios Retrospectivos , Regulador Transcripcional ERG/genética
7.
PLoS Comput Biol ; 9(1): e1002852, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23308060

RESUMEN

New microbial genomes are sequenced at a high pace, allowing insight into the genetics of not only cultured microbes, but a wide range of metagenomic collections such as the human microbiome. To understand the deluge of genomic data we face, computational approaches for gene functional annotation are invaluable. We introduce a novel model for computational annotation that refines two established concepts: annotation based on homology and annotation based on phyletic profiling. The phyletic profiling-based model that includes both inferred orthologs and paralogs-homologs separated by a speciation and a duplication event, respectively-provides more annotations at the same average Precision than the model that includes only inferred orthologs. For experimental validation, we selected 38 poorly annotated Escherichia coli genes for which the model assigned one of three GO terms with high confidence: involvement in DNA repair, protein translation, or cell wall synthesis. Results of antibiotic stress survival assays on E. coli knockout mutants showed high agreement with our model's estimates of accuracy: out of 38 predictions obtained at the reported Precision of 60%, we confirmed 25 predictions, indicating that our confidence estimates can be used to make informed decisions on experimental validation. Our work will contribute to making experimental validation of computational predictions more approachable, both in cost and time. Our predictions for 998 prokaryotic genomes include ~400000 specific annotations with the estimated Precision of 90%, ~19000 of which are highly specific-e.g. "penicillin binding," "tRNA aminoacylation for protein translation," or "pathogenesis"-and are freely available at http://gorbi.irb.hr/.


Asunto(s)
Perfilación de la Expresión Génica , Filogenia , Escherichia coli/genética , Genes Bacterianos , Modelos Teóricos
8.
Cell Discov ; 10(1): 8, 2024 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-38228615

RESUMEN

The regulation of protein function by external or internal signals is one of the key features of living organisms. The ability to directly control the function of a selected protein would represent a valuable tool for regulating biological processes. Here, we present a generally applicable regulation of proteins called INSRTR, based on inserting a peptide into a loop of a target protein that retains its function. We demonstrate the versatility and robustness of coiled-coil-mediated regulation, which enables designs for either inactivation or activation of selected protein functions, and implementation of two-input logic functions with rapid response in mammalian cells. The selection of insertion positions in tested proteins was facilitated by using a predictive machine learning model. We showcase the robustness of the INSRTR strategy on proteins with diverse folds and biological functions, including enzymes, signaling mediators, DNA binders, transcriptional regulators, reporters, and antibody domains implemented as chimeric antigen receptors in T cells. Our findings highlight the potential of INSRTR as a powerful tool for precise control of protein function, advancing our understanding of biological processes and developing biotechnological and therapeutic interventions.

9.
Antibiotics (Basel) ; 13(3)2024 Feb 23.
Artículo en Inglés | MEDLINE | ID: mdl-38534649

RESUMEN

The COVID-19 pandemic has strained healthcare systems globally. Shortages of hospital beds, reassignment of healthcare workers to COVID-19-dedicated wards, an increased workload, and evolving infection prevention and control measures have potentially contributed to the spread of multidrug-resistant bacteria (MDRB). To determine the impact of the COVID-19 pandemic at the University Medical Center Ljubljana, a tertiary teaching hospital, we analyzed the monthly incidence of select bacterial species per patient from 2018 to 2022. The analysis was performed for all isolates and for MDRB isolates. The data were analyzed separately for isolates from all clinical samples, from blood culture only, and from clinical and surveillance samples. Our findings revealed an increased incidence density of patients with Enterococcus faecium, Staphylococcus aureus, Escherichia coli, and Pseudomonas aeruginosa isolates from clinical samples during the COVID-19 period in the studied hospital. Notably, the incidence density of MDRB isolates-vancomycin-resistant E. faecium, extended-spectrum betalactamase-producing K. pneumoniae, and betalactam-resistant P. aeruginosa-from clinical samples increased during the COVID-19 period. There were no statistically significant differences in the incidence density of patients with blood culture MDRB isolates. We observed an increase in the overall MDRB burden (patients with MDRB isolates from both clinical and surveillance samples per 1000 patient days) in the COVID-19 period in the studied hospital for vancomycin-resistant E. faecium, carbapenem-resistant K. pneumoniae, and betalactam-resistant P. aeruginosa and a decrease in the methicillin-resistant S. aureus burden.

10.
BMC Bioinformatics ; 14: 285, 2013 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-24070402

RESUMEN

BACKGROUND: Ontologies and catalogs of gene functions, such as the Gene Ontology (GO) and MIPS-FUN, assume that functional classes are organized hierarchically, that is, general functions include more specific ones. This has recently motivated the development of several machine learning algorithms for gene function prediction that leverages on this hierarchical organization where instances may belong to multiple classes. In addition, it is possible to exploit relationships among examples, since it is plausible that related genes tend to share functional annotations. Although these relationships have been identified and extensively studied in the area of protein-protein interaction (PPI) networks, they have not received much attention in hierarchical and multi-class gene function prediction. Relations between genes introduce autocorrelation in functional annotations and violate the assumption that instances are independently and identically distributed (i.i.d.), which underlines most machine learning algorithms. Although the explicit consideration of these relations brings additional complexity to the learning process, we expect substantial benefits in predictive accuracy of learned classifiers. RESULTS: This article demonstrates the benefits (in terms of predictive accuracy) of considering autocorrelation in multi-class gene function prediction. We develop a tree-based algorithm for considering network autocorrelation in the setting of Hierarchical Multi-label Classification (HMC). We empirically evaluate the proposed algorithm, called NHMC (Network Hierarchical Multi-label Classification), on 12 yeast datasets using each of the MIPS-FUN and GO annotation schemes and exploiting 2 different PPI networks. The results clearly show that taking autocorrelation into account improves the predictive performance of the learned models for predicting gene function. CONCLUSIONS: Our newly developed method for HMC takes into account network information in the learning phase: When used for gene function prediction in the context of PPI networks, the explicit consideration of network autocorrelation increases the predictive performance of the learned models. Overall, we found that this holds for different gene features/ descriptions, functional annotation schemes, and PPI networks: Best results are achieved when the PPI network is dense and contains a large proportion of function-relevant interactions.


Asunto(s)
Biología Computacional/métodos , Anotación de Secuencia Molecular/métodos , Mapas de Interacción de Proteínas/genética , Algoritmos , Inteligencia Artificial , Bases de Datos Genéticas , Ontología de Genes , Proteínas de Saccharomyces cerevisiae/clasificación , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
11.
Data Brief ; 48: 109138, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-37128582

RESUMEN

In the field of environment and health studies, recent trends have focused on the identification of contaminants of emerging concern (CEC). This is a complex, challenging task, as resources, such as compound databases (DBs) and mass spectral libraries (MSLs) concerning these compounds are very poor. This is particularly true for semi polar organic contaminants that have to be derivatized prior to gas chromatography-mass spectrometry (GC-MS) analysis with electron impact ionization (EI), for which it is barely possible to find any records. In particular, there is a severe lack of datasets of GC-EI-MS spectra generated and made publicly available for the purpose of development, validation and performance evaluation of cheminformatics-assisted compound structure identification (CSI) approaches, including novel cutting-edge machine learning (ML)-based approaches [1]. We set out to fill this gap and support the machine learning-assisted compound identification, thus aiding cheminformatics-assisted identification of silylated derivatives in GC-MS laboratories working in the field of environment and health. To this end, we have generated 12 datasets of GC-EI-MS spectra, six of which contain GC-EI-MS spectra of trimethylsilyl (TMS) and six GC-EI-MS spectra of tert-butyldimethylsilyl (TBDMS) derivatives. Four of these datasets, named testing datasets, contain mass spectra acquired by the authors. They are available in full, together with corresponding metadata. Eight datasets, named training datasets, were derived from mass spectra in the NIST 17 Mass Spectral Library. For these, we have only made the metadata publicly available, due to licensing reasons. For each type of derivative, two testing datasets are generated by acquiring and processing GC-EI-MS spectra, such that they include raw and processed GC-EI-MS spectra of TMS and TBDMS derivatives of CECs, along with their corresponding metadata. The metadata contains IUPAC name, exact mass, molecular formula, InChI, InChIKey, SMILES and PubChemID, of each CEC and CEC-TMS or CEC-TBDMS derivative, where available. Eight GC-EI-MS training datasets are generated by using the National Institute of Standards and Technology (NIST)/U.S. Environmental Protection Agency (EPA)/National Institute of Health (NIH) 17 Mass Spectral Library. For each derivative type (TMS and TBDMS), four datasets are given, each corresponding to an original dataset obtained from NIST/EPA/NIH 17 and three variants thereof, obtained after each of the filtering steps of the procedure described below. Only the metadata about the training datasets are available, describing the corresponding NIST/EPA/NIH 17 entires: These include the compound name, CAS Registry number, InChIKey, exact mass, Mw, NIST number and ID number. The datasets we present here were used to train and test predictive models for identification of silylated derivatives built with ML approaches [4]. The models were built by using data curated from the NIST Mass Spectral Library 17 [2] and the machine learning approach of CSI:Output Kernel Regression (CSI:OKR) [2]. Data from the NIST Mass Spectral Library 17 are commercially available from the National Institute of Standards and Technology (NIST)/U.S. Environmental Protection Agency (EPA)/National Institute of Health (NIH) and thus cannot be made publicly available. This highlights the need for publicly available GC-EI-MS spectra, which we address by releasing in full the four testing datasets.

12.
Life (Basel) ; 13(2)2023 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-36836747

RESUMEN

The utilization of a non-invasive electroencephalogram (EEG) as an input sensor is a common approach in the field of the brain-computer interfaces (BCI). However, the collected EEG data pose many challenges, one of which may be the age-related variability of event-related potentials (ERPs), which are often used as primary EEG BCI signal features. To assess the potential effects of aging, a sample of 27 young and 43 older healthy individuals participated in a visual oddball study, in which they passively viewed frequent stimuli among randomly occurring rare stimuli while being recorded with a 32-channel EEG set. Two types of EEG datasets were created to train the classifiers, one consisting of amplitude and spectral features in time and another with extracted time-independent statistical ERP features. Among the nine classifiers tested, linear classifiers performed best. Furthermore, we show that classification performance differs between dataset types. When temporal features were used, maximum individuals' performance scores were higher, had lower variance, and were less affected overall by within-class differences such as age. Finally, we found that the effect of aging on classification performance depends on the classifier and its internal feature ranking. Accordingly, performance will differ if the model favors features with large within-class differences. With this in mind, care must be taken in feature extraction and selection to find the correct features and consequently avoid potential age-related performance degradation in practice.

13.
Sci Data ; 10(1): 558, 2023 08 23.
Artículo en Inglés | MEDLINE | ID: mdl-37612295

RESUMEN

In our study, we set out to collect a multimodal annotated dataset for remote sensing of Maya archaeology, that is suitable for deep learning. The dataset covers the area around Chactún, one of the largest ancient Maya urban centres in the central Yucatán Peninsula. The dataset includes five types of data records: raster visualisations and canopy height model from airborne laser scanning (ALS) data, Sentinel-1 and Sentinel-2 satellite data, and manual data annotations. The manual annotations (used as binary masks) represent three different types of ancient Maya structures (class labels: buildings, platforms, and aguadas - artificial reservoirs) within the study area, their exact locations, and boundaries. The dataset is ready for use with machine learning, including convolutional neural networks (CNNs) for object recognition, object localization (detection), and semantic segmentation. We would like to provide this dataset to help more research teams develop their own computer vision models for investigations of Maya archaeology or improve existing ones.

14.
J Fungi (Basel) ; 9(11)2023 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-37998891

RESUMEN

Safe drinking water is a constant challenge due to global environmental changes and the rise of emerging pathogens-lately, these also include fungi. The fungal presence in water greatly varies between sampling locations. Little is known about fungi from water in combination with a selection of materials used in water distribution systems. Our research was focused on five water plants located in the Pannonian Plain, Slovenia. Sampled water originated from different natural water sources and was subjected to different cleaning methods before distribution. The average numbers of fungi from natural water, water after disinfection, water at the first sampling point in the water network, and water at the last sampling point were 260, 49, 64, and 97 CFU/L, respectively. Chlorination reduced the number of fungi by a factor of 5, but its effect decreased with the length of the water network. The occurrence of different fungi in water and on materials depended on the choice of material. The presence of the genera Aspergillus, Acremonium, Furcasterigmium, Gliomastix, and Sarocladium was mostly observed on cement, while Cadophora, Cladosporium, Cyphellophora, and Exophiala prevailed on metals. Plastic materials were more susceptible to colonization with basidiomycetous fungi. Opportunistically pathogenic fungi were isolated sporadically from materials and water and do not represent a significant health risk for water consumers. In addition to cultivation data, physico-chemical features of water were measured and later processed with machine learning methods, revealing the sampling location and water cleaning processes as the main factors affecting fungal presence and richness in water and materials in contact with water.

15.
Biosensors (Basel) ; 13(2)2023 Jan 26.
Artículo en Inglés | MEDLINE | ID: mdl-36831953

RESUMEN

Nowadays, morphology and molecular analyses at the single-cell level have a fundamental role in understanding biology better. These methods are utilized for cell phenotyping and in-depth studies of cellular processes, such as mitosis. Fluorescence microscopy and optical spectroscopy techniques, including Raman micro-spectroscopy, allow researchers to examine biological samples at the single-cell level in a non-destructive manner. Fluorescence microscopy can give detailed morphological information about the localization of stained molecules, while Raman microscopy can produce label-free images at the subcellular level; thus, it can reveal the spatial distribution of molecular fingerprints, even in live samples. Accordingly, the combination of correlative fluorescence and Raman microscopy (CFRM) offers a unique approach for studying cellular stages at the single-cell level. However, subcellular spectral maps are complex and challenging to interpret. Artificial intelligence (AI) may serve as a valuable solution to characterize the molecular backgrounds of phenotypes and biological processes by finding the characteristic patterns in spectral maps. The major contributions of the manuscript are: (I) it gives a comprehensive review of the literature focusing on AI techniques in Raman-based cellular phenotyping; (II) via the presentation of a case study, a new neural network-based approach is described, and the opportunities and limitations of AI, specifically deep learning, are discussed regarding the analysis of Raman spectroscopy data to classify mitotic cellular stages based on their spectral maps.


Asunto(s)
Inteligencia Artificial , Espectrometría Raman , Microscopía Fluorescente/métodos , Espectrometría Raman/métodos
16.
Front Microbiol ; 14: 1258670, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38029120

RESUMEN

Historically valuable canvas paintings are often exposed to conditions enabling microbial deterioration. Painting materials, mainly of organic origin, in combination with high humidity and other environmental conditions, favor microbial metabolism and growth. These preconditions are often present during exhibitions or storage in old buildings, such as churches and castles, and also in museum storage depositories. The accumulated dust serves as an inoculum for both indoor and outdoor fungi. In our study, we present the results on cultivable fungi isolated from 24 canvas paintings, mainly exhibited in Slovenian sacral buildings, dating from the 16th to 21st centuries. Fungi were isolated from the front and back of damaged and undamaged surfaces of the paintings using culture media with high- and low-water activity. A total of 465 isolates were identified using current taxonomic DNA markers and assigned to 37 genera and 98 species. The most abundant genus was Aspergillus, represented by 32 species, of which 9 xerophilic species are for the first time mentioned in contaminated paintings. In addition to the most abundant xerophilic A. vitricola, A. destruens, A. tardicrescens, and A. magnivesiculatus, xerophilic Wallemia muriae and W. canadensis, xerotolerant Penicillium chrysogenum, P. brevicompactum, P. corylophilum, and xerotolerant Cladosporium species were most frequent. When machine learning methods were used to predict the relationship between fungal contamination, damage to the painting, and the type of material present, proteins were identified as one of the most important factors and cracked paint was identified as a hotspot for fungal growth. Aspergillus species colonize paintings regardless of materials, while Wallemia spp. can be associated with animal fat. Culture media with low-water activity are suggested in such inventories to isolate and obtain an overview of fungi that are actively contaminating paintings stored indoors at low relative humidity.

17.
Comput Biol Med ; 141: 105001, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34782112

RESUMEN

Many clinical studies follow patients over time and record the time until the occurrence of an event of interest (e.g., recovery, death, …). When patients drop out of the study or when their event did not happen before the study ended, the collected dataset is said to contain censored observations. Given the rise of personalized medicine, clinicians are often interested in accurate risk prediction models that predict, for unseen patients, a survival profile, including the expected time until the event. Survival analysis methods are used to detect associations or compare subpopulations of patients in this context. In this article, we propose to cast the time-to-event prediction task as a multi-target regression task, with censored observations modeled as partially labeled examples. We then apply semi-supervised learning to the resulting data representation. More specifically, we use semi-supervised predictive clustering trees and ensembles thereof. Empirical results over eleven real-life datasets demonstrate superior or equivalent predictive performance of the proposed approach as compared to three competitor methods. Moreover, smaller models are obtained compared to random survival forests, another tree ensemble method. Finally, we illustrate the informative feature selection mechanism of our method, by interpreting the splits induced by a single tree model when predicting survival for amyotrophic lateral sclerosis patients.


Asunto(s)
Aprendizaje Automático Supervisado , Análisis por Conglomerados , Humanos , Análisis Multivariante , Análisis de Supervivencia
18.
J Cheminform ; 14(1): 62, 2022 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-36109826

RESUMEN

MOTIVATION: Compound structure identification is using increasingly more sophisticated computational tools, among which machine learning tools are a recent addition that quickly gains in importance. These tools, of which the method titled Compound Structure Identification:Input Output Kernel Regression (CSI:IOKR) is an excellent example, have been used to elucidate compound structure from mass spectral (MS) data with significant accuracy, confidence and speed. They have, however, largely focused on data coming from liquid chromatography coupled to tandem mass spectrometry (LC-MS). Gas chromatography coupled to mass spectrometry (GC-MS) is an alternative which offers several advantages as compared to LC-MS, including higher data reproducibility. Of special importance is the substantial compound coverage offered by GC-MS, further expanded by derivatization procedures, such as silylation, which can improve the volatility, thermal stability and chromatographic peak shape of semi-volatile analytes. Despite these advantages and the increasing size of compound databases and MS libraries, GC-MS data have not yet been used by machine learning approaches to compound structure identification. RESULTS: This study presents a successful application of the CSI:IOKR machine learning method for the identification of environmental contaminants from GC-MS spectra. We use CSI:IOKR as an alternative to exhaustive search of MS libraries, independent of instrumental platform and data processing software. We use a comprehensive dataset of GC-MS spectra of trimethylsilyl derivatives and their molecular structures, derived from a large commercially available MS library, to train a model that maps between spectra and molecular structures. We test the learned model on a different dataset of GC-MS spectra of trimethylsilyl derivatives of environmental contaminants, generated in-house and made publicly available. The results show that 37% (resp. 50%) of the tested compounds are correctly ranked among the top 10 (resp. 20) candidate compounds suggested by the model. Even though spectral comparisons with reference standards or de novo structural elucidations are neccessary to validate the predictions, machine learning provides efficient candidate prioritization and reduction of the time spent for compound annotation.

19.
Sci Rep ; 12(1): 7267, 2022 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-35508507

RESUMEN

Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literature. Hence, ensuring proper, correct, robust, and trustworthy benchmarking is of utmost importance for the further development of the field. We believe that this can be achieved by adhering to the recently emerged data management standards, such as the FAIR (Findable, Accessible, Interoperable, and Reusable) and TRUST (Transparency, Responsibility, User focus, Sustainability, and Technology) principles. We introduce an ontology-based online catalogue of MLC datasets originating from various application domains following these principles. The catalogue extensively describes many MLC datasets with comprehensible meta-features, MLC-specific semantic descriptions, and different data provenance information. The MLC data catalogue is available at: http://semantichub.ijs.si/MLCdatasets .


Asunto(s)
Aprendizaje Automático , Semántica , Publicaciones
20.
J Fungi (Basel) ; 8(8)2022 Aug 16.
Artículo en Inglés | MEDLINE | ID: mdl-36012848

RESUMEN

Beach safety regulation is based on faecal indicators in water, leaving out sand and fungi, whose presence in both matrices has often been reported. To study the abundance, diversity and possible fluctuations of mycobiota, fungi from sand and seawater were isolated from the Portoroz beach (Slovenia) during a 1-year period. Sand analyses yielded 64 species of 43 genera, whereas seawater samples yielded 29 species of 18 genera. Environmental and taxonomical data of fungal communities were analysed using machine learning approaches. Changes in the air and water temperature, sunshine hours, humidity and precipitation, air pressure and wind speed appeared to affect mycobiota. The core genera Aphanoascus, Aspergillus, Fusarium, Bisifusarium, Penicillium, Talaromyces, and Rhizopus were found to compose a stable community within sand, although their presence and abundance fluctuated along with weather changes. Aspergillus spp. were the most abundant and thus tested against nine antimycotics using Sensititre Yeast One kit. Aspergillus niger and A. welwitschiae isolates were found to be resistant to amphotericin B. Additionally, four possible human pollution indicators were isolated during the bathing season, including Meyerozyma, which can be used in beach microbial regulation. Our findings provide the foundations for additional research on sand and seawater mycobiota and show the potential effect of global warming and extreme weather events on fungi in sand and sea.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA