RESUMO
Monitoring and predicting the environmental impact of wastewater is essential for sustainable aquaculture. The environmental DNA metabarcoding-integrated supervised machine learning (SML) algorithm is an alternative method for ecological quality assessment and prediction. However, the ecological integrity of aquaculture wastewater and available effective input features for prediction remain unclear. Here, we used the multispecies biological integrity index (Ms-IBI) to provide a detailed categorization, identifying over half of the samples (53.85 %) as highly impacted, emphasizing the urgency to address ecological degradation; Ms-IBI emerged as a reliable label for SML models. By condensing 410 effective indicators and integrating 25 core operational taxonomic unit features with external stressors, an accuracy of 0.78 and R2 of 0.96 was achieved. Utilizing only external stressors yielded a comparably good performance with fewer input features, obtaining an accuracy of 0.74 and an R2 of 0.91. The integration of external stressors in this study highlights a practical predictive method that meets the ecological quality requirements of aquaculture wastewater, aiding the reversal of global ecological decline.
RESUMO
At the plasma membrane, in response to biotic and abiotic cues, specific ligands initiate the formation of receptor kinase heterodimers, which regulate activities of plasma membrane proteins and initiate signaling cascades to the nucleus. In this study, we utilized affinity enrichment mass spectrometry (AE-MS) to investigate the stimulus-dependent interactomes of LRR receptor kinases in response to their respective ligands, with an emphasis on exploring structural influences and potential cross-talk events at the plasma membrane. BRI1 and SIRK1 were chosen as receptor kinases with distinct coreceptor preference. By using interactome characteristic of domain-swap chimera following a gradient boosting learning algorithm trained on SIRK1 and BRI1 interactomes, we attribute contributions of extracellular domain, transmembrane domain, juxtamembrane domain and kinase domain of respective ligand-binding receptors to their interaction with their coreceptors and substrates. Our results revealed juxtamembrane domain as major structural element defining the specific substrate recruitment for BRI1 and extracellular domain for SIRK1. Furthermore, the learning algrorithm enabled us to predict the phenotypic outcomes of chimeric receptors based on different domain combinations, which was verified by dedicated experiments. As a result, our work reveals a tightly controlled balance of signaling cascade activation dependent on ligand-binding receptors domains and the internal ligand status of the plant. Moreover, our study shows the robust utility of machine learning classification as a quantitative metric for studying dynamic interactomes, dissecting the contribution of specific domains and predicting their phenotypic outcome.
RESUMO
AIMS: Neuronal disorders have affected more than 15% of the world's population, signifying the importance of continued design and development of drugs that can cross the Blood-Brain Barrier (BBB). BACKGROUND: BBB limits the permeability of external compounds by 98% to maintain and regulate brain homeostasis. Hence, BBB permeability prediction is vital to predict the activity of a drug-like substance. OBJECTIVE: Here, we report about developing BBBper (Blood-Brain Barrier permeability prediction) using machine learning tool. METHOD: A supervised machine learning-based online tool, based on physicochemical parameters to predict the BBB permeability of given chemical compounds was developed. The user-end webpage was developed in HTML and linked with back-end server by a python script to run user queries and results. RESULT: BBBper uses a random forest algorithm at the back end, showing 97% accuracy on the external dataset, compared to 70-92% accuracy of currently available web-based BBB permeability prediction tools. CONCLUSION: The BBBper web tool is freely available at http://bbbper.mdu.ac.in.
RESUMO
BACKGROUND: Effective preventive interventions for PTSD rely on early identification of individuals at risk for developing PTSD. To establish early post-trauma who are at risk, there is a need for accurate prognostic risk screening instruments for PTSD that can be widely implemented in recently trauma-exposed adults. Achieving such accuracy and generalizability requires external validation of machine learning classification models. The current 2-ASAP cohort study will perform external validation on both full and minimal feature sets of supervised machine learning classification models assessing individual risk to follow an adverse PTSD symptom trajectory over the course of 1 year. We will derive these models from the TraumaTIPS cohort, separately for men and women. METHOD: The 2-ASAP longitudinal cohort will include N = 863 adults (N = 436 females, N = 427 males) who were recently exposed to acute civilian trauma. We will include civilian victims of accidents, crime and calamities at Victim Support Netherlands; and who were presented for medical evaluation of (suspected) traumatic injuries by emergency transportation to the emergency department. The baseline assessment within 2 months post-trauma will include self-report questionnaires on demographic, medical and traumatic event characteristics; potential risk and protective factors for PTSD; PTSD symptom severity and other adverse outcomes; and current best-practice PTSD screening instruments. Participants will be followed at 3, 6, 9, and 12 months post-trauma, assessing PTSD symptom severity and other adverse outcomes via self-report questionnaires. DISCUSSION: The ultimate goal of our study is to improve accurate screening and prevention for PTSD in recently trauma-exposed civilians. To enable future large-scale implementation, we will use self-report data to inform the prognostic models; and we will derive a minimal feature set of the classification models. This can be transformed into a short online screening instrument that is user-friendly for recently trauma-exposed adults to fill in. The eventual short online screening instrument will classify early post-trauma which adults are at risk for developing PTSD. Those at risk can be targeted and may subsequently benefit from preventive interventions, aiming to reduce PTSD and relatedly improve psychological, functional and economic outcomes.
Assuntos
Transtornos de Estresse Pós-Traumáticos , Humanos , Transtornos de Estresse Pós-Traumáticos/prevenção & controle , Transtornos de Estresse Pós-Traumáticos/diagnóstico , Estudos Longitudinais , Masculino , Feminino , Adulto , Estudos Prospectivos , Países Baixos , Programas de Rastreamento/métodos , Aprendizado de MáquinaRESUMO
It is essential to elucidate the molecular mechanisms underlying liver transplant tolerance and rejection. In cases of mouse liver transplantation between inbred strains, immunological rejection of the allograft is reduced with spontaneous apoptosis without immunosuppressive drugs, which differs from the actual clinical result. This may be because inbred strains are genetically homogeneous and less heterogeneous than others. We exploited outbred CD1 mice, which show highly heterogeneous genotypes among individuals, to search for biomarkers related to immune responses and to construct a model for predicting the outcome of liver allografting. Of the 36 mice examined, 18 died within 3 weeks after transplantation, while the others survived for more than 6 weeks. Whole-exome sequencing of the 36 donors revealed more than 9 million variants relative to the C57BL/6 J reference. We selected 6517 single-nucleotide and indel variants and performed machine learning to determine whether or not we could predict the prognosis of each genotype. Models were built by both deep learning with a one-dimensional convolutional neural network and linear classification and evaluated by leave-one-out cross-validation. Given that one short-lived mouse died early in an accident, the models perfectly predicted the outcome of all individuals, suggesting the importance of genotype collection. In addition, linear classification models provided a list of loci potentially responsible for these responses. The present methods as well as results is likely to be applicable to liver transplantation in humans.
Assuntos
Genótipo , Rejeição de Enxerto , Tolerância Imunológica , Transplante de Fígado , Aprendizado de Máquina Supervisionado , Animais , Camundongos , Tolerância Imunológica/genética , Rejeição de Enxerto/genética , Rejeição de Enxerto/imunologia , Fígado/imunologia , Fígado/patologia , Fígado/metabolismo , Camundongos Endogâmicos C57BL , Masculino , Sobrevivência de Enxerto/imunologia , Sobrevivência de Enxerto/genética , Animais não Endogâmicos , Sequenciamento do ExomaRESUMO
Color vision assessment is essential in clinical practice, yet different tests exhibit distinct strengths and limitations. Here we apply a psychophysical paradigm, Angular Indication Measurement (AIM) for color detection and discrimination. AIM is designed to address some of the shortcomings of existing tests, such as prolonged testing time, limited accuracy and sensitivity, and the necessity for clinician oversight. AIM presents adaptively generated charts, each a N×M (here 4×4) grid of stimuli, and participants are instructed to indicate either the orientation of the gap in a cone-isolating Landolt C optotype or the orientation of the edge between two colors in an equiluminant color space. The contrasts or color differences of the stimuli are adaptively selected for each chart based on performance of prior AIM charts. In a group of 23 color-normal and 15 people with color vision deficiency (CVD), we validate AIM color against Hardy-Rand-Rittler (HRR), Farnsworth-Munsell 100 hue test (FM100), and anomaloscope color matching diagnosis and use machine learning techniques to classify the type and severity of CVD. The results show that AIM has classification accuracies comparable to that of the anomaloscope, and while HRR and FM100 are less accurate than AIM and an anomaloscope, HRR is very rapid. We conclude that AIM is a computer-based, self-administered, response-adaptive and rapid tool with high test-retest repeatability that has the potential to be suitable for both clinical and research applications.
RESUMO
Recent research has shown that people who gamble are more likely to have suicidal thoughts and attempts compared to the general population. Despite the advancements made, no study to date has predicted suicide risk factors in people who gamble using machine learning algorithms. Therefore, current study aimed to identify the most critical predictors of suicidal ideation and suicidal attempts among people who gamble using a machine learning approach. An online survey conducted a cross-sectional analysis of 741 people who gamble (mean age: 25.9 ± 5.56). To predict the risk of suicide attempts and ideation, we employed a comprehensive set of 40 biological, psychological, social, and socio-demographic variables. The predictive models were developed using Logistic Regression, Random Forest (RF), robust eXtreme Gradient Boosting (XGBoost), and ensemble machine learning algorithms. Data analysis was performed using R-Studio software. Random Forest emerged as the top-performing algorithm for predicting suicidal ideation, with an impressive AUC of 0.934, sensitivity of 0.7514, specificity of 0.9885, PPV of 0.9473, and NPV of 0.9347. Across all models, dissociation, depression, and anxiety symptoms consistently emerged as crucial predictors of suicidal ideation. However, for suicide attempt prediction, all models exhibited weaker performance. XGBoost showed the best performance in this regard, with an AUC of 0.663, sensitivity of 0.78, specificity of 0.8990, PPV of 0.34, NPV of 0.984, and accuracy of 0.8918. Depressive symptoms and rumination severity were highlighted as the most important predictors of suicide attempts according to this model. These findings have important implications for clinical practice and public health interventions. Machine learning could help detect individuals prone to suicidal ideation and suicide attempts among people who gamble, assisting in creating tailored prevention programs to address future suicide risks more effectively.
RESUMO
The purpose of the research was to discover which variables better predict phonemic awareness. Socioeconomic status (SES), quality of parent-child interaction (PCI), screen time (DST), visual-spatial ability (VSA), and mathematical reasoning (MR) were included as independent variables in the model, while phonemic awareness (PA) was the dependent (outcome) variable. The research was designed as correlational research. A total of 556 first grade primary school students were recruited into the research sample upon approval by their parents. In the analytic procedures, supervised machine learning was adopted and data were analyzed through classification and regression trees (CART) by using rprart, rpart.plot, tidyverse, dplyr, ggplot2, and corrplot packages in R. Results of data analysis indicate that MR, PCI, and VSA can predict PA, while SES and DST are not predictors. Findings of the research were discussed along with intelligence theories and practical implications were noted for teachers and researchers.
RESUMO
Understanding and predicting the ecological status of urbanized rivers is crucial for their restoration and management. However, the complex and nonlinear nature of ecological responses poses a challenge to the development of predictive models. Here, the study investigated and predicted the status of eukaryotic plankton communities in urbanized rivers by coupling environmental DNA metabarcoding, the alternative stable states theory, and supervised machine learning (SML) models. The results revealed two distinct states of eukaryotic plankton communities under similar environmental conditions: one state was characterized by the enrichment of a diverse phytoplankton population and the high relative abundance of protozoa, whereas the alternative state was characterized by abundant phytoplankton and fungi with an associated risk of algal blooms. Turbidity was identified as a key driver based on the SML model and Mantel test. Potential analysis demonstrated that the response pattern of eukaryotic plankton communities to turbidity was thresholds with hysteresis (Threshold1 = 17 NTU, Threshold2 = 24 NTU). A reduction in turbidity induced a regime shift in the eukaryotic plankton community toward an alternative state associated with a risk of algal blooms. In the prediction of ecological status, both SML models showed excellent performance (R2 > 0.80, RMSE < 0.1, Kappa > 0.70). Additionally, SHapley Additive exPlanations analysis identified turbidity, chlorophyll-a, chemical oxygen demand (COD), ammonia nitrogen and green algae's amplicon sequence variants as crucial features for prediction, with turbidity and COD showing a synergistic effect on ecological status. A framework was further proposed to enhance the understanding and prediction of ecological status in urbanized rivers. The obtained results of this study demonstrated the feasibility of using SML models to predict and explain the ecological status of urbanized rivers with alternative stable states. This provides valuable insights for the application of SML models in the restoration and management of urbanized rivers.
Assuntos
Plâncton , Rios , Aprendizado de Máquina Supervisionado , Urbanização , Monitoramento Ambiental , FitoplânctonRESUMO
Two-spotted spider mite (Tetranychus urticae) is an important greenhouse pest. In cucumbers, heavy infestations lead to the complete loss of leaf assimilation surface, resulting in plant death. Symptoms caused by spider mite feeding alter the light reflection of leaves and could therefore be optically detected. Machine learning methods have already been employed to analyze spectral information in order to differentiate between healthy and spider mite-infested leaves of crops such as tomatoes or cotton. In this study, machine learning methods were applied to cucumbers. Hyperspectral data of leaves were recorded under controlled conditions. Effective wavelengths were identified using three feature selection methods. Subsequently, three supervised machine learning algorithms were used to classify healthy and spider mite-infested leaves. All combinations of feature selection and classification methods yielded accuracy of over 80%, even when using ten or five wavelengths. These results suggest that machine learning methods are a powerful tool for image-based detection of spider mites in cucumbers. In addition, due to the limited number of wavelengths, there is also substantial potential for practical application.
Assuntos
Cucumis sativus , Aprendizado de Máquina , Folhas de Planta , Tetranychidae , Animais , Tetranychidae/fisiologia , Tetranychidae/classificação , Cucumis sativus/parasitologia , Imageamento Hiperespectral/métodosRESUMO
High diversity seabed habitats, such as shellfish aggregations, play a significant role in marine ecosystem sustainability but are susceptible to bottom disturbance induced by anthropogenic activities. Regular monitoring of these habitats with effective mapping methods is therefore essential. Multibeam echosounder (MBES) has been widely used in recent decades for seabed characterization due to its non-destructive manner and extensive spatial coverage compared to traditional methods like bottom sampling. Nevertheless, bottom sampling remains essential to link ground truth with acoustic seabed classification. Using seabed samples and MBES measurements, machine learning techniques are commonly employed to model their relationships and generate classification maps of an extended seabed. However, limited ground truth data, resulting from constraints in regulations, budget, or time, may impede the development of robust machine learning models. To address this challenge, we applied a semi-supervised machine learning method to classify seabed sediments of a blue mussel (Mytilus edulis) cultivation area in the Oosterschelde, the Netherlands. We utilized nine boxcore samples to generate pseudo-labels on MBES data. These pseudo-labels enlarged the training data size, facilitated the training of three comprehensive machine learning algorithms (Gradient Boosting, Random Forest, and Support Vector Machine), and helped to classify the study site into mussel and non-mussel areas. We found the geomorphological and backscatter-related features to be complementary for mussel culture detection. Our classification results were demonstrated effective through expert knowledge of this cultivation area and brought insights for future research on natural mussel habitats.
Assuntos
Ecossistema , Animais , Monitoramento Ambiental/métodos , Aprendizado de Máquina Supervisionado , Países Baixos , Bivalves , Aprendizado de Máquina , Mytilus edulisRESUMO
BACKGROUND: Reduced bone density is recognized as a predictor for potential complications in reverse shoulder arthroplasty (RSA). While humeral and glenoid planning based on preoperative computed tomography (CT) scans assist in implant selection and position, reproducible methods for quantifying the patients' bone density are currently not available. The purpose of this study was to perform bone density analyses including patient specific calibration in an RSA cohort based on preoperative CT imaging. It was hypothesized that preoperative CT bone density measures would provide objective quantification of the patients' humeral bone quality. METHODS: This study consisted of three parts, (1) analysis of a patient-specific calibration method in cadaveric CT scans, (2) retrospective application in a clinical RSA cohort, and (3) clustering and classification with machine learning models. Forty cadaveric shoulders were scanned in a clinical CT and compared regarding calibration with density phantoms, air muscle, and fat (patient-specific) or standard Hounsfield unit. Post-scan patient-specific calibration was used to improve the extraction of three-dimensional regions of interest for retrospective bone density analysis in a clinical RSA cohort (n=345). Machine learning models were used to improve the clustering (Hierarchical Ward) and classification (Support Vector Machine (SVM)) of low bone densities in the respective patients. RESULTS: The patient-specific calibration method demonstrated improved accuracy with excellent intraclass correlation coefficients (ICC) for cylindrical cancellous bone densities (ICC>0.75). Clustering partitioned the training data set into a high-density subgroup consisting of 96 patients and a low-density subgroup consisting of 146 patients, showing significant differences between these groups. The SVM showed optimized prediction accuracy of low and high bone densities compared to conventional statistics in the training (accuracy=91.2%; AUC=0.967) and testing (accuracy=90.5 %; AUC=0.958) data set. CONCLUSION: Preoperative CT scans can be used to quantify the proximal humeral bone quality in patients undergoing RSA. The use of machine learning models and patient-specific calibration on bone mineral density demonstrated that multiple 3D bone density scores improved the accuracy of objective preoperative bone quality assessment. The trained model could provide preoperative information to surgeons treating patients with potentially poor bone quality.
RESUMO
India has been dealing with fluoride contamination of groundwater for the past few decades. Long-term exposure of fluoride can cause skeletal and dental fluorosis. Therefore, an in-depth exploration of fluoride concentrations in different parts of India is desirable. This work employs machine learning algorithms to analyze the fluoride concentrations in five major affected Indian states (Andhra Pradesh, Rajasthan, Tamil Nadu, Telangana and West Bengal). A correlation matrix was used to identify appropriate predictor variables for fluoride prediction. The various algorithms used for predictions included K-nearest neighbor (KNN), logistic regression (LR), random forest (RF), support vector classifier (SVC), Gaussian NB, MLP classifier, decision tree classifier, gradient boosting classifier, voting classifier soft and voting classifier hard. The performance of these models is assessed over accuracy, precision, recall and error rate and receiver operating curve. As the dataset was skewed, the performance of models was evaluated before and after resampling. Analysis of results indicates that the RF model is the best model for predicting fluoride contamination in groundwater in Indian states.
Assuntos
Fluoretos , Água Subterrânea , Poluentes Químicos da Água , Índia , Água Subterrânea/análise , Água Subterrânea/química , Fluoretos/análise , Poluentes Químicos da Água/análise , Aprendizado de Máquina Supervisionado , Monitoramento Ambiental/métodos , AlgoritmosRESUMO
BACKGROUND: Nowadays, social media plays a crucial role in disseminating information about cancer prevention and treatment. A growing body of research has focused on assessing access and communication effects of cancer information on social media. However, there remains a limited understanding of the comprehensive presentation of cancer prevention and treatment methods across social media platforms. Furthermore, research comparing the differences between medical social media (MSM) and common social media (CSM) is also lacking. OBJECTIVE: Using big data analytics, this study aims to comprehensively map the characteristics of cancer treatment and prevention information on MSM and CSM. This approach promises to enhance cancer coverage and assist patients in making informed treatment decisions. METHODS: We collected all posts (N=60,843) from 4 medical WeChat official accounts (accounts with professional medical backgrounds, classified as MSM in this paper) and 5 health and lifestyle WeChat official accounts (accounts with nonprofessional medical backgrounds, classified as CSM in this paper). We applied latent Dirichlet allocation topic modeling to extract cancer-related posts (N=8427) and identified 6 cancer themes separately in CSM and MSM. After manually labeling posts according to our codebook, we used a neural-based method for automated labeling. Specifically, we framed our task as a multilabel task and utilized different pretrained models, such as Bidirectional Encoder Representations from Transformers (BERT) and Global Vectors for Word Representation (GloVe), to learn document-level semantic representations for labeling. RESULTS: We analyzed a total of 4479 articles from MSM and 3948 articles from CSM related to cancer. Among these, 35.52% (2993/8427) contained prevention information and 44.43% (3744/8427) contained treatment information. Themes in CSM were predominantly related to lifestyle, whereas MSM focused more on medical aspects. The most frequently mentioned prevention measures were early screening and testing, healthy diet, and physical exercise. MSM mentioned vaccinations for cancer prevention more frequently compared with CSM. Both types of media provided limited coverage of radiation prevention (including sun protection) and breastfeeding. The most mentioned treatment measures were surgery, chemotherapy, and radiotherapy. Compared with MSM (1137/8427, 13.49%), CSM (2993/8427, 35.52%) focused more on prevention. CONCLUSIONS: The information about cancer prevention and treatment on social media revealed a lack of balance. The focus was primarily limited to a few aspects, indicating a need for broader coverage of prevention measures and treatments in social media. Additionally, the study's findings underscored the potential of applying machine learning to content analysis as a promising research approach for mapping key dimensions of cancer information on social media. These findings hold methodological and practical significance for future studies and health promotion.
Assuntos
Aprendizado de Máquina , Neoplasias , Mídias Sociais , Mídias Sociais/estatística & dados numéricos , Humanos , Neoplasias/prevenção & controle , Neoplasias/terapia , ChinaRESUMO
The ESKAPE family, comprising Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter spp., poses a significant global threat due to their heightened virulence and extensive antibiotic resistance. These pathogens contribute largely to the prevalence of nosocomial or hospital-acquired infections, resulting in high morbidity and mortality rates. To tackle this healthcare problem urgent measures are needed, including development of innovative vaccines and therapeutic strategies. Designing vaccines involves a complex and resource-intensive process of identifying protective antigens and potential vaccine candidates (PVCs) from pathogens. Reverse vaccinology (RV), an approach based on genomics, made this process more efficient by leveraging bioinformatics tools to identify potential vaccine candidates. In recent years, artificial intelligence and machine learning (ML) techniques has shown promise in enhancing the accuracy and efficiency of reverse vaccinology. This study introduces a supervised ML classification framework, to predict potential vaccine candidates specifically against ESKAPE pathogens. The model's training utilized biological and physicochemical properties from a dataset containing protective antigens and non-protective proteins of ESKAPE pathogens. Conventional autoencoders based strategy was employed for feature encoding and selection. During the training process, seven machine learning algorithms were trained and subjected to Stratified 5-fold Cross Validation. Random Forest and Logistic Regression exhibited best performance in various metrics including accuracy, precision, recall, WF1 score, and Area under the curve. An ensemble model was developed, to take collective strengths of both the algorithms. To assess efficacy of our final ensemble model, a high-quality benchmark dataset was employed. VacSol-ML(ESKAPE) demonstrated outstanding discrimination between protective vaccine candidates (PVCs) and non-protective antigens. VacSol-ML(ESKAPE), proves to be an invaluable tool in expediting vaccine development for these pathogens. Accessible to the public through both a web server and standalone version, it encourages collaborative research. The web-based and standalone tools are available at http://vacsolml.mgbio.tech/.
Assuntos
Antígenos de Bactérias , Vacinas Bacterianas , Aprendizado de Máquina , Antígenos de Bactérias/imunologia , Humanos , Vacinas Bacterianas/imunologia , Klebsiella pneumoniae/imunologia , Klebsiella pneumoniae/genética , Klebsiella pneumoniae/patogenicidade , Enterococcus faecium/imunologia , Enterococcus faecium/genética , Staphylococcus aureus/imunologia , Staphylococcus aureus/genética , Acinetobacter baumannii/imunologia , Pseudomonas aeruginosa/imunologia , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/patogenicidade , Biologia Computacional/métodos , Enterobacter/imunologia , Enterobacter/genética , Vacinologia/métodosRESUMO
Clinical prediction models serve as valuable instruments for assessing the risk of crucial outcomes and facilitating decision-making in clinical settings. Constructing these models requires nuanced analytical decisions and expertise informed by the current statistical literature. Access and thorough understanding of such literature may be limited for neurocritical care physicians, which may hinder the interpretation of existing predictive models. The present emphasis is on narrowing this knowledge gap by providing neurocritical care specialists with methodological guidance for interpreting predictive models in neurocritical care. Presented are the statistical learning principles integral to constructing a model predicting hospital mortality (nonsurvival during hospitalization) in patients with moderate and severe blunt traumatic brain injury using components of the IMPACT-Core model. Discussion encompasses critical elements such as model flexibility, hyperparameter selection, data imbalance, cross-validation, model assessment (discrimination and calibration), prediction instability, and probability thresholds. The intricate interplay among these components, the data set, and the clincal context of neurocritical care is elaborated. Leveraging this comprehensive exploration of statistical learning can enhance comprehension of articles encompassing model generation, tailored clinical care, and, ultimately, better interpretation and clinical applicability of predictive models.
RESUMO
BACKGROUND: Prevalence of suicidal behaviour increases rapidly in middle to late adolescence. Predicting suicide attempts across different ages would enhance our understanding of how suicidal behaviour manifests in this period of rapid development. This study aimed to develop separate models to predict suicide attempts within a cohort at middle and late adolescence. It also sought to examine differences between the models derived across both developmental stages. METHODS: This study used data from the nationally representative Longitudinal Study of Australian Children (N = 2266). We selected over 700 potential suicide attempt predictors measured via self-report questionnaires, and linked healthcare and education administrative datasets. Logistic regression, random forests, and gradient boosting algorithms were developed to predict suicide attempts across two stages (mid-adolescence: 14-15 years; late adolescence: 18-19 years) using predictors sampled two years prior (mid-adolescence: 12-13 years; late adolescence: 16-17 years). RESULTS: The late adolescence models (AUROC = 0.77-0.88, F1-score = 0.22-0.28, Sensitivity = 0.54-0.64) performed better than the mid-adolescence models (AUROC = 0.70-0.76, F1-score = 0.12-0.19, Sensitivity = 0.40-0.64). The most important features for predicting suicide attempts in mid-adolescence were mostly school-related, while the most important features in late adolescence included measures of prior suicidality, psychosocial health, and future plans. CONCLUSIONS: To date, this is the first study to use machine learning models to predict suicide attempts at different ages. Our findings suggest that the optimal suicide risk prediction model differs by stage of adolescence. Future research and interventions should consider that risk presentations can change rapidly during adolescence.
Assuntos
Aprendizado de Máquina , Tentativa de Suicídio , Humanos , Adolescente , Tentativa de Suicídio/estatística & dados numéricos , Feminino , Masculino , Estudos Longitudinais , Austrália/epidemiologia , Fatores Etários , Criança , Adulto Jovem , Fatores de Risco , Modelos LogísticosRESUMO
Objectives: Accelerating demand for patient messaging has impacted the practice of many providers. Messages are not recommended for urgent medical issues, but some do require rapid attention. This presents an opportunity for artificial intelligence (AI) methods to prioritize review of messages. Our study aimed to highlight some patient portal messages for prioritized review using a custom AI system integrated into the electronic health record (EHR). Materials and Methods: We developed a Bidirectional Encoder Representations from Transformers (BERT)-based large language model using 40 132 patient-sent messages to identify patterns involving high acuity topics that warrant an immediate callback. The model was then implemented into 2 shared pools of patient messages managed by dozens of registered nurses. A primary outcome, such as the time before messages were read, was evaluated with a difference-in-difference methodology. Results: Model validation on an expert-reviewed dataset (n = 7260) yielded very promising performance (C-statistic = 97%, average-precision = 72%). A binarized output (precision = 67%, sensitivity = 63%) was integrated into the EHR for 2 years. In a pre-post analysis (n = 396 466), an improvement exceeding the trend was observed in the time high-scoring messages sit unread (21 minutes, 63 vs 42 for messages sent outside business hours). Discussion: Our work shows great promise in improving care when AI is aligned with human workflow. Future work involves audience expansion, aiding users with suggested actions, and drafting responses. Conclusion: Many patients utilize patient portal messages, and while most messages are routine, a small fraction describe alarming symptoms. Our AI-based workflow shortens the turnaround time to get a trained clinician to review these messages to provide safer, higher-quality care.
RESUMO
The three Ground Reaction Force (GRF) components can be estimated using pressure insole sensors. In this paper, we compare the accuracy of estimating GRF components for both feet using six methods: three Deep Learning (DL) methods (Artificial Neural Network, Long Short-Term Memory, and Convolutional Neural Network) and three Supervised Machine Learning (SML) methods (Least Squares, Support Vector Regression, and Random Forest (RF)). Data were collected from nine subjects across six activities: normal and slow walking, static with and without carrying a load, and two Manual Material Handling activities. This study has two main contributions: first, the estimation of GRF components (Fx, Fy, and Fz) during the six activities, two of which have never been studied; second, the comparison of the accuracy of GRF component estimation between the six methods for each activity. RF provided the most accurate estimation for static situations, with mean RMSE values of RMSE_Fx = 1.65 N, RMSE_Fy = 1.35 N, and RMSE_Fz = 7.97 N for the mean absolute values measured by the force plate (reference) RMSE_Fx = 14.10 N, RMSE_Fy = 3.83 N, and RMSE_Fz = 397.45 N. In our study, we found that RF, an SML method, surpassed the experimented DL methods.
Assuntos
Aprendizado Profundo , Pressão , Aprendizado de Máquina Supervisionado , Humanos , Masculino , Caminhada/fisiologia , Redes Neurais de Computação , Sapatos , Adulto , Feminino , Pé/fisiologia , Fenômenos Biomecânicos/fisiologia , Adulto JovemRESUMO
Posttraumatic stress disorder (PTSD) is a chronic psychiatric condition that follows exposure to a traumatic stressor. Though previous in vivo proton (1H) MRS) research conducted at 4 T or lower has identified alterations in glutamate metabolism associated with PTSD predisposition and/or progression, no prior investigations have been conducted at higher field strength. In addition, earlier studies have not extensively addressed the impact of psychiatric comorbidities such as major depressive disorder (MDD) on PTSD-associated 1H-MRS-visible brain metabolite abnormalities. Here we employ 7 T 1H MRS to examine concentrations of glutamate, glutamine, GABA, and glutathione in the medial prefrontal cortex (mPFC) of PTSD patients with MDD (PTSD+MDD+; N = 6) or without MDD (PTSD+MDD-; N = 5), as well as trauma-unmatched controls without PTSD but with MDD (PTSD-MDD+; N = 9) or without MDD (PTSD-MDD-; N = 18). Participants with PTSD demonstrated decreased ratios of GABA to glutamine relative to healthy PTSD-MDD- controls but no single-metabolite abnormalities. When comorbid MDD was considered, however, MDD but not PTSD diagnosis was significantly associated with increased mPFC glutamine concentration and decreased glutamate:glutamine ratio. In addition, all participants with PTSD and/or MDD collectively demonstrated decreased glutathione relative to healthy PTSD-MDD- controls. Despite limited findings in single metabolites, patterns of abnormality in prefrontal metabolite concentrations among individuals with PTSD and/or MDD enabled supervised classification to separate them from healthy controls with 80+% sensitivity and specificity, with glutathione, glutamine, and myoinositol consistently among the most informative metabolites for this classification. Our findings indicate that MDD can be an important factor in mPFC glutamate metabolism abnormalities observed using 1H MRS in cohorts with PTSD.