Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 231
Filtrar
1.
Sci Prog ; 107(4): 368504241293008, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39492190

RESUMO

Purpose: This study aims to develop a predictive model for epidermal growth factor receptor (EGFR) mutations in lung adenocarcinoma by integrating computed tomography (CT) imaging features with clinical characteristics. Methods: A retrospective analysis was conducted using electronic medical records from 194 patients diagnosed with lung adenocarcinoma between January 2016 and December 2020, with approval from the institutional review board. Features were selected using LASSO regression, and predictive models were built using logistic regression, support vector machine, and random forest methods. Individual models were created for clinical features, CT imaging features, and a combined model to predict EGFR mutations. Results: The training set revealed that alcohol consumption, intrapulmonary metastasis, and pleural effusion were statistically significant in distinguishing between wild-type and mutation groups (p < 0.05). In the testing set, hilar and mediastinal lymphadenopathy showed statistical significance (p < 0.05). The combined model outperformed the individual clinical and CT imaging feature models. In the testing set, the logistic regression model achieved the highest AUC of 0.827, with sensitivity, specificity, and accuracy of 0.714, 0.712, and 0.712, respectively. Nomogram analysis identified lobulation as an important feature, with a predicted probability of up to 0.9. The decision curve analysis showed that the CT imaging feature model provided a higher net benefit compared to both the clinical feature model and the combined model. Conclusion: In summary, while the combined model outperformed the individual feature models in the testing set, the CT imaging feature model demonstrated the greatest clinical net benefit. Lobulation was identified as an important predictor of EGFR mutations in lung adenocarcinoma.


Assuntos
Adenocarcinoma de Pulmão , Receptores ErbB , Neoplasias Pulmonares , Mutação , Tomografia Computadorizada por Raios X , Humanos , Receptores ErbB/genética , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/diagnóstico por imagem , Adenocarcinoma de Pulmão/patologia , Tomografia Computadorizada por Raios X/métodos , Masculino , Feminino , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/diagnóstico por imagem , Neoplasias Pulmonares/patologia , Pessoa de Meia-Idade , Estudos Retrospectivos , Idoso , Nomogramas , Adulto
2.
Sci Rep ; 14(1): 25359, 2024 10 25.
Artigo em Inglês | MEDLINE | ID: mdl-39455658

RESUMO

This study aimed to develop a machine learning (ML) model for predicting pulmonary embolism (PE) in patients with gastrointestinal cancers, a group at increased risk for PE. We conducted a retrospective, multicenter study analyzing patients who underwent computed tomographic pulmonary angiography (CTPA) between 2010 and 2020. The study utilized demographic and clinical data, including the Wells score and D-dimer levels, to train a random forest ML model. The model's effectiveness was assessed using the area under the receiver operating curve (AUROC). In total, 446 patients from hospital A and 139 from hospital B were included. The training set consisted of 356 patients from hospital A, with internal validation on 90 and external validation on 139 patients from hospital B. The model achieved an AUROC of 0.736 in hospital A and 0.669 in hospital B. The ML model significantly reduced the number of patients recommended for CTPA compared to the conventional diagnostic strategy (hospital A; 100.0% vs. 91.1%, P < 0.001, hospital B; 100.0% vs. 93.5%, P = 0.003). The results indicate that an ML-based prediction model can reduce unnecessary CTPA procedures in gastrointestinal cancer patients, highlighting its potential to enhance diagnostic efficiency and reduce patient burden.


Assuntos
Neoplasias Gastrointestinais , Aprendizado de Máquina , Embolia Pulmonar , Humanos , Embolia Pulmonar/diagnóstico por imagem , Masculino , Feminino , Estudos Retrospectivos , Pessoa de Meia-Idade , Idoso , Neoplasias Gastrointestinais/diagnóstico por imagem , Tomografia Computadorizada por Raios X/métodos , Angiografia por Tomografia Computadorizada/métodos , Procedimentos Desnecessários/estatística & dados numéricos , Curva ROC
3.
Environ Sci Technol ; 58(44): 19843-19850, 2024 Nov 05.
Artigo em Inglês | MEDLINE | ID: mdl-39443164

RESUMO

Per-and polyfluoroalkyl substances (PFAS) are synthetic chemicals that are increasingly being detected in groundwater. The negative health consequences associated with human exposure to PFAS make it essential to quantify the distribution of PFAS in groundwater systems. Mapping PFAS distributions is particularly challenging because a national patchwork of testing and reporting requirements has resulted in sparse and spatially biased data. In this analysis, an inhomogeneous Poisson process (IPP) modeling approach is adopted from ecological statistics to continuously map PFAS distributions in groundwater across the contiguous United States. The model is trained on a unique data set of 8910 PFAS groundwater measurements, using combined concentrations of two PFAS analytes. The IPP model predictions are compared with results from random forest models to highlight the robustness of this statistical modeling approach on sparse data sets. This analysis provides a new approach to not only map PFAS contamination in groundwater but also prioritize future sampling efforts.


Assuntos
Monitoramento Ambiental , Água Subterrânea , Poluentes Químicos da Água , Água Subterrânea/química , Estados Unidos , Poluentes Químicos da Água/análise , Fluorocarbonos/análise , Humanos
4.
J Environ Manage ; 370: 122615, 2024 Sep 24.
Artigo em Inglês | MEDLINE | ID: mdl-39321676

RESUMO

This paper leverages a data-driven two-step approach to effectively evaluate the effects of COVID-19 lockdown on air pollution in both the short and long-term in China. Using air pollution, meteorological conditions, and air mass clusters from 34 air quality monitoring stations in Beijing from 2015 to 2022, this study first employs a deweathering machine learning technique to decouple the confounding effects of meteorological on the air pollution. Furthermore, a detrending percentage change indictor is applied to remove the influence of seasonal variations on air pollution. The findings reveal that: (1) Human interventions are the primary drivers of changes in air pollution concentrations, whereas meteorological factors have a relatively minor impact. (2) During the COVID-19 lockdown, significant variations in air pollution levels are observed, with the effects of city lockdown ranging from a decrease of 40.11% ± 14.81% to an increase of 20.28% ± 14.36%. Notably, there is a decline in concentrations of NO2, PM2.5, CO, and PM10, while the levels of O3 and SO2 increase even during the strictest lockdown period. (3) In the year following the COVID-19 lockdown, there is a rebound in overall air pollution levels. However, by the second year, a general decline in air pollution is observed, except for O3. Therefore, it is imperative to integrate the confounding effects of meteorological factors into air quality management policies under various future scenarios: adopt high-intensity control measures for sudden air quality deteriorations, advance green recovery initiatives for long-term emission reductions, and coordinate efforts to reduce composite atmospheric pollution.

5.
Sci Total Environ ; 954: 176467, 2024 Sep 24.
Artigo em Inglês | MEDLINE | ID: mdl-39326764

RESUMO

Riparian zones play a crucial role in reducing nitrate pollution in both terrestrial and aquatic environments. Complex deposition action and dynamic hydrological processes will change the grain size distribution of riparian sediments, affect the residence time of substances, and have a cascade effect on the biogeochemical process of nitrate nitrogen (NO3--N). However, simultaneous studies on NO3--N transformation and the potential drivers in riparian zones are still lacking, especially neglecting the effect of sediment grain size (SGS). To fill this knowledge gap, we first systematically identified and quantified NO3--N biogeochemical processes in the riparian zone by integrating molecular biotechnology, 15N stable isotope tracing, and microcosmic incubation experiments. We then evaluated the combined effects of environmental variables (including pH, dissolved organic carbon (DOC), oxidation reduction potential, SGS, etc.) on NO3--N transformation through Random Forest and Structural Equation Models. The results demonstrated that NO3--N underwent five microbial-mediated processes, with denitrification, dissimilatory nitrate reduction to ammonium (DNRA) dominated the NO3--N attenuation (69.4 % and 20.1 %, respectively), followed by anaerobic ammonia oxidation (anammox) and nitrate-dependent ferric oxidation (NDFO) (8.4 % and 2.1 %, respectively), while nitrification dominated the NO3--N production. SGS emerged as the most critical factor influencing NO3--N transformation (24.96 %, p < 0.01), followed by functional genes (nirS, nrfA) abundance, DOC, and ammonia concentrations (14.12 %, 16.40 %, 13.08 %, p < 0.01). SGS influenced NO3--N transformation by regulating microbial abundance and nutrient concentrations. RF predicted that a 5 % increase in the proportion of fine grains (diameter < 50 µm) may increase the NO3--N transformation rate by 3.8 %. This work highlights the significance of integrating machine learning and geochemical analysis for a comprehensive understanding of nitrate biogeochemical processes in riparian zones, contributing valuable references for future nitrogen management strategies.

6.
Environ Geochem Health ; 46(10): 418, 2024 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-39249634

RESUMO

Fluoride (F) is a trace element that is essential to the human body and occurs naturally in the environment. However, a deficiency or excess of F in the environment can potentially lead to human health issues. The pseudototal amount of F in soil often does not correlate directly with the F content in plants. Instead, the F content within plants tends to have a greater correlation with the bioavailable F in soils. In large-scale soil surveys, only the pseudototal elemental content of soils is typically measured, which may not be highly reliable for developing agricultural zoning plans. There are significant variations in the ability of different plants to accumulate F from soil. Additionally, due to variations in soil elemental absorption mechanisms among different plant species, when multiple crops are grown in an area, it is typically necessary to study the elemental absorption mechanisms of each crop. To address these issues, in this study, we examined the factors influencing F bioaccumulation coefficients in different crops based on 1:50,000 soil geochemical survey data. Using the random forest algorithm, four indicators-bioavailable P, bioavailable Zn, leachable Pb, and Sr-were selected from among 29 parameters to predict the F content within crops to replace bioavailable F in the soil. Compared with the multivariate linear regression (MLR) model, the random forest (RF) model provided more accurate and reliable predictions of the fluoride content in crops, with the RF model's prediction accuracy improving by approximately 95.23%. Additionally, while the partial least squares regression (PLSR) model also offered improved accuracy over MLR, the RF model still outperformed PLSR in terms of prediction accuracy and robustness. Additionally, it maximized the utilization of existing geochemical survey data, enabling cross-species studies for the first time and avoiding redundant evaluations of different types of agricultural products in the same region. In this investigation, we selected the Xining-Ledu region of Qinghai Province, China, as the study area and employed a random forest model to predict the crop F content in soils, providing a new methodological framework for crop production that effectively enhances agricultural quality and efficiency.


Assuntos
Algoritmos , Produtos Agrícolas , Fluoretos , Poluentes do Solo , Produtos Agrícolas/química , Produtos Agrícolas/metabolismo , Fluoretos/análise , Poluentes do Solo/análise , Solo/química , Monitoramento Ambiental/métodos , Modelos Lineares , Algoritmo Florestas Aleatórias
7.
Int J Cardiol Cardiovasc Risk Prev ; 22: 200319, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39253594

RESUMO

Objective: The purpose of the research is to explore post-thrombotic syndrome (PTS) after catheter-directed thrombolysis (CDT) treatment for acute lower extremity deep vein thrombosis (DVT) risk factors. Methods: We retrospectively selected 171 patients with acute lower extremity DVT undergoing CDT treatment, collected clinical data of the patients, grouped them according to the follow-up results of 1 year after treatment, and included patients with PTS into the concurrent group and patients who did not develop PTS assigned to the unconcurrent group. Univariate analysis and Logistic regression were applied to analyze the risk factors of PTS after catheterization and thrombolytic therapy for acute lower extremity DVT. We applied R4.2.3 software to build three hybrid machine-learning models, including a nomogram, decision tree, and random forest with independent influencing factors as predictive variables. Results: The incidence of PTS after CDT in acute lower extremity DVT was 36.84 %. BMI >24.33 kg/m2, disease time >7 d, mixed DVT, varicose vein history, stress treatment time>6.5 months, and filter category were independent risk factors for PTS after CDT treatment for acute lower extremity DVT. The AUC value predicted by the random forest model was higher than that of the nomogram model (Z = -2.337, P = 0.019) and the decision tree model (Z = -2.995, P = 0.003). Conclusion: The occurrence of PTS after CDT treatment of acute lower extremity DVT is closely related to many factors, and the established random forest model had the best effect in predicting PTS complicated with PTS.

8.
Water Res ; 267: 122507, 2024 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-39342713

RESUMO

Nitrate dynamics within a catchment are critical to the earth's system process, yet the intricate details of its transport and transformation at high resolutions remain elusive. Hydrological effects on nitrate dynamics in particular have not been thoroughly assessed previously and this knowledge gap hampers our understanding and effective management of nitrogen cycling in watersheds. Here, machine learning (ML) models were employed to reconstruct the annual variation trend in nitrate dynamics and isotopes within a typical karst catchment. Random forest model demonstrates promising potential in predicting nitrate concentration and its isotopes, surpassing other ML models (including Long Short-term Memory, Convolutional Neural Network, and Support Vector Machine) in performance. The ML-modeled NO3--N concentrations, δ15N-NO3-, and δ18O-NO3- values were in close agreement with field data (NSE values of 0.95, 0.80, and 0.53, respectively), which are notably challenging to achieve for process models. During the transition from dry to wet period, approximately 23.0 % of the annual precipitation (∼269.1 mm) was identified as the threshold for triggering a rapid response in the wet period. The modeled nitrate isotope values were significantly supported by the field data, suggesting seasonal variations of nitrogen sources, with precipitation as the primary driving force for fertilizer sources. Mixing of multiple sources appeared to be the main control of the transport and transformation of nitrate during the rising limb in the wet period, whereas process control (denitrification) took precedence during the falling limb, and the fate of nitrate was controlled by biogeochemical processes during the dry period.

9.
Sci Total Environ ; 954: 176567, 2024 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-39349190

RESUMO

The study addresses the challenge of integrating complex landscape-hydrological interactions into predictive models for improved water resource management. The aim is to investigate the effectiveness of landscape metrics-quantitative indices measuring landscape composition and configuration-as predictors of WES in the Arno River Basin, Italy. Utilizing two hydrological models alongside a random forest algorithm, we assessed spatial and temporal variations in water yield, runoff, and groundwater recharge. The findings indicate that landscape metrics derived from high-resolution land use data significantly impact WES outcomes. Specifically, the models demonstrated average landscape metric importances of 16.8 % for spatial and 17.8 % for temporal predictions concerning runoff. For water yield, these averages were 32.9 % spatially and 43.5 % temporally, while groundwater modeling showed importances of 14.09 % spatially and 33.8 % temporally. Key landscape metrics identified include the core area index for broad-leaved forests and the perimeter-to-area ratio for non-irrigated agricultural areas as critical spatial and temporal predictors of water yield and groundwater recharge. Thresholds were observed, indicating landscape configurations that minimize hydrological variability. For instance, runoff variation is minimal when the landscape exhibits high forest fragmentation (over 1000 coniferous patches), low aggregation (aggregation index <75), and reduced connectivity (cohesion index under 80). Similarly, groundwater variation is minimized with decreased boundary length of vegetation patches (perimeter-to-area ratio <0.8), agricultural lands (perimeter-to-area ratio under 1), and the presence of low core agricultural areas (core area index above 8). The identified thresholds could inform land-use policies, such as targeted afforestation or crop diversification strategies, to optimize WES provision.

10.
Environ Sci Technol ; 58(40): 17532-17542, 2024 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-39315849

RESUMO

Epidemiological studies on associations of organophosphate ester (OPE) exposure and gestational diabetes mellitus (GDM) risk, which remain rare and inconclusive, were carried out with a case-control population comprising 287 GDM and 313 non-GDM pregnant women recruited from Tianjin. The GDM group suffered distinctly higher serum concentrations of tri-n-butyl phosphate (TNBP), tri(2-butoxyethyl) phosphate (TBOEP), triphenyl phosphate (TPHP), tri-iso-propyl phosphate (TIPP), and tri(1-chloro-2-propyl) phosphate (TCIPP) than the healthy control group (p < 0.001). Traditional analysis methods employed for either individual or mixture effects found positive correlations (p < 0.05) between the concentrations of five OPEs (i.e., TNBP, TBOEP, TPHP, TIPP, and TCIPP) and the incidence of GDM, while 2-ethylhexyl diphenyl phosphate, tri(1-chloro-2-propyl) phosphate, and bis(2-ethylhexyl) phosphate exhibited opposite effects. Three machine learning methods considering the concurrence of OPE mixture exposure and population characteristics were applied to clarify their relative importance to GDM risk, among which random forest performed the best. Several OPEs, particularly TNBP and TBOEP ranking at the top, made greater contributions than some demographical characteristics, such as prepregnancy body mass index and family history of diabetes, to the occurrence of GDM. This was further validated by another independent case-control population obtained from Hangzhou.


Assuntos
Diabetes Gestacional , Organofosfatos , Humanos , Diabetes Gestacional/epidemiologia , Feminino , Gravidez , Estudos de Casos e Controles , Adulto , Ésteres
11.
Med Phys ; 51(11): 8434-8441, 2024 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-39269979

RESUMO

BACKGROUND: Aortic dissection (AD) is a life-threatening cardiovascular emergency that is often misdiagnosed as other chest pain conditions. Physiologically, AD may cause abnormalities in peripheral blood flow, which can be detected using pulse oximetry waveforms. PURPOSE: This study aimed to assess the feasibility of identifying AD based on pulse oximetry waveforms and to highlight the key waveform features that play a crucial role in this diagnostic method. METHODS: This prospective study employed high-risk chest pain cohorts from two emergency departments. The initial cohort was enriched with AD patients (n = 258, 47% AD) for model development, while the second cohort consisted of chest pain patients awaiting angiography (n = 71, 25% AD) and was used for external validation. Pulse oximetry waveforms from the four extremities were collected for each patient. After data preprocessing, a recognition model based on the random forest algorithm was trained using patients' gender, age, and waveform difference features extracted from the pulse oximetry waveforms. The performance of the model was evaluated using receiver operating characteristic (ROC) curve analysis and decision curve analysis (DCA). The importance of features was also assessed using Shapley Value and Gini importance. RESULTS: The model demonstrated strong performance in identifying AD in both the training and external validation sets. In the training set, the model achieved an area under the ROC curve of 0.979 (95% CI: 0.961-0.990), sensitivity of 0.918 (95% CI: 0.873-0.955), specificity of 0.949 (95% CI: 0.912-0.985), and accuracy of 0.933 (95% CI: 0.904-0.959). In the external validation set, the model attained an area under the ROC curve of 0.855 (95% CI: 0.720-0.965), sensitivity of 0.889 (95% CI: 0.722-1.000), specificity of 0.698 (95% CI: 0.566-0.812), and accuracy of 0.794 (95% CI: 0.672-0.878). Decision curve analysis (DCA) further showed that the model provided a substantial net benefit for identifying AD. The median mean and median variance of the four limbs' signals were the most influential features in the recognition model. CONCLUSIONS: This study demonstrated the feasibility and strong performance of identifying AD based on peripheral pulse oximetry waveforms in high-risk chest pain populations in the emergency setting. The findings also provided valuable insights for future human fluid dynamics simulations to elucidate the impact of AD on blood flow in greater detail.


Assuntos
Dissecção Aórtica , Oximetria , Humanos , Oximetria/métodos , Dissecção Aórtica/diagnóstico por imagem , Dissecção Aórtica/fisiopatologia , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Estudos de Viabilidade , Estudos Prospectivos
12.
J Water Health ; 22(9): 1606-1617, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39340374

RESUMO

Household water treatment (HWT) is recommended when safe drinking water is limited. To understand determinants of HWT adoption, we conducted a cross-sectional survey with 650 households across different regions in Haiti. Data were collected on 71 demographic and psychosocial factors and 2 outcomes (self-reported and confirmed HWT use). Data were transformed into 169 possible determinants of adoption across nine categories. We assessed determinants using logistic regression and, as machine learning methods are increasingly used, random forest analyses. Overall, 376 (58%) respondents self-reported treating or purchasing water, and 123 (19%) respondents had residual chlorine in stored household water. Both logistic regression and machine learning analyses had high accuracy (area under the receiver operating characteristic curve (AUC): 0.77-0.82), and the strongest determinants in models were in the demographics and socioeconomics, risk belief, and WASH practice categories. Determinants that can be influenced inform HWT promotion in Haiti. It is recommended to increase access to HWT products, provide cash and education on water treatment to emergency-impacted populations, and focus future surveys on known determinants of adoption. We found both regression and machine learning methods need informed, thoughtful, and trained analysts to ensure meaningful results and discuss the benefits/drawbacks of analysis methods herein.


Assuntos
Características da Família , Aprendizado de Máquina , Purificação da Água , Haiti , Purificação da Água/métodos , Humanos , Modelos Logísticos , Estudos Transversais , Água Potável , Feminino , Masculino , Adulto , Abastecimento de Água , Fatores Socioeconômicos
13.
Sci Total Environ ; 951: 175768, 2024 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-39191325

RESUMO

The river course is a transitional area connecting the source and receiving water bodies. The dissolved organic matter (DOM) in the river course is an important factor affecting the aquatic environment and ecological health. However, there are shortcomings in studying the differences and quantitative contributions of river DOM in different seasons and sources. In this study, ultraviolet-visible (UV-vis) and three-dimensional fluorescence spectra were used to characterize the optical properties, analyze the spatiotemporal changes, and establish the quantitative relationship between environmental factors and DOM in the inflow rivers of Baiyangdian Lake. The results showed that the relative DOM concentrations in summer and autumn were significantly higher than those in the other seasons (P < 0.001) and that the DOM source (SR < 1) was mainly exogenous. The fluorescence abundance of protein-like substances (C1 + C2 + C3) was the highest in spring, whereas that of humus C4 was the highest in autumn. Moreover, the inflow rivers exhibited strong autogenetic characteristics (BIX > 1) throughout the year. Self-organizing maps (SOM) indicated that the main driving factors of water quality were NO3--N in spring, autumn, and winter and DO, pH, and chemical oxygen demand (COD) in summer. Random forest analysis showed that the fluorescent components (C1-C4) were closely related to the migration and transformation of nitrogen, and pH and nitrogen were the main predictors of each component. The Mantel test and structural equation model (SEM) showed that temperature and NO3--N significantly influenced the DOM concentration, components, and molecular properties in different seasons. Moreover, the river source also affected the distribution mechanism of DOM in the water body. Our study comprehensively analyzed the response of DOM in inflow rivers in different seasons and water sources, providing a basis for further understanding the driving mechanisms of water quality.

14.
Sci Rep ; 14(1): 18834, 2024 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-39138311

RESUMO

As we all know, momentum plays a crucial role in ball game. Based on the 2023 Wimbledon final data, this paper investigated momentum in tennis. Firstly, we initially trained a decision tree regression model on reprocessed data for prediction, and established the CBRF model based on CatBoost regression and random forest regression models to obtain prediction data. Secondly, significant non-zero autocorrelation coefficients were found, confirming the correlation between momentum and success. Thirdly, Based on these key factors, we proposed winning strategies for the players, conducted predictive analyses for six specific time intervals of the game. At last, by implementing these models to women's matches, championships, matches on different surfaces, the results demonstrated that the models have effective generalization ability.

15.
Sci Total Environ ; 950: 175281, 2024 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-39117235

RESUMO

Machine learning models (MLMs) have been increasingly used to forecast water pollution. However, the "black box" characteristic for understanding mechanism processes still limits the applicability of MLMs for water quality management in hydro-projects under complex and frequently artificial regulation. This study proposes an interpretable machine learning framework for water quality prediction coupled with a hydrodynamic (flow discharge) scenario-based Random Forest (RF) model with multiple model-agnostic techniques and quantifies global, local, and joint interpretations (i.e., partial dependence, individual conditional expectation, and accumulated local effects) of environmental factor implications. The framework was applied and verified to predict the permanganate index (CODMn) under different flow discharge regulation scenarios in the Middle Route of the South-to-North Water Diversion Project of China (MRSNWDPC). A total of 4664 sampling cases data matrices, including water quality, meteorological, and hydrological indicators from eight national stations along the main canal of the MRSNWDPC, were collected from May 2019 to December 2020. The results showed that the RF models were effective in forecasting CODMn in all flow discharge scenarios, with a mean square error, coefficient of determination, and mean absolute error of 0.006-0.026, 0.481-0.792, and 0.069-0.104, respectively, in the testing dataset. A global interpretation indicated that dissolved oxygen, flow discharge, and surface pressure are the three most important variables of CODMn. Local and joint interpretations indicated that the RF-based prediction model provides a basic understanding of the physical mechanisms of environmental systems. The proposed framework can effectively learn the fundamental environmental implications of water quality variations and provide reliable prediction performance, highlighting the importance of model interpretability for trustworthy machine learning applications in water management projects. This study provides scientific references for applying advanced data-driven MLMs to water quality forecasting and a reliable methodological framework for water quality management and similar hydro-projects.

16.
BMC Public Health ; 24(1): 2101, 2024 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-39097727

RESUMO

With childhood hypertension emerging as a global public health concern, understanding its associated factors is crucial. This study investigated the prevalence and associated factors of hypertension among Chinese children. This cross-sectional investigation was conducted in Pinghu, Zhejiang province, involving 2,373 children aged 8-14 years from 12 schools. Anthropometric measurements were taken by trained staff. Blood pressure (BP) was measured in three separate occasions, with an interval of at least two weeks. Childhood hypertension was defined as systolic blood pressure (SBP) and/or diastolic blood pressure (DBP) ≥ age-, sex-, and height-specific 95th percentile, across all three visits. A self-administered questionnaire was utilized to collect demographic, socioeconomic, health behavioral, and parental information at the first visit of BP measurement. Random forest (RF) and multivariable logistic regression model were used collectively to identify associated factors. Additionally, population attributable fractions (PAFs) were calculated. The prevalence of childhood hypertension was 5.0% (95% confidence interval [CI]: 4.1-5.9%). Children with body mass index (BMI) ≥ 85th percentile were grouped into abnormal weight, and those with waist circumference (WC) > 90th percentile were sorted into central obesity. Normal weight with central obesity (NWCO, adjusted odds ratio [aOR] = 5.04, 95% CI: 1.96-12.98), abnormal weight with no central obesity (AWNCO, aOR = 4.60, 95% CI: 2.57-8.21), and abnormal weight with central obesity (AWCO, aOR = 9.94, 95% CI: 6.06-16.32) were associated with an increased risk of childhood hypertension. Childhood hypertension was attributable to AWCO mostly (PAF: 0.64, 95% CI: 0.50-0.75), followed by AWNCO (PAF: 0.34, 95% CI: 0.19-0.51), and NWCO (PAF: 0.13, 95% CI: 0.03-0.30). Our results indicated that obesity phenotype is associated with childhood hypertension, and the role of weight management could serve as potential target for intervention.


Assuntos
Hipertensão , Humanos , Estudos Transversais , Masculino , Feminino , Hipertensão/epidemiologia , China/epidemiologia , Criança , Prevalência , Adolescente , Fatores de Risco , Modelos Logísticos , Algoritmo Florestas Aleatórias
17.
Acad Radiol ; 2024 Jul 12.
Artigo em Inglês | MEDLINE | ID: mdl-39003227

RESUMO

RATIONALE AND OBJECTIVES: Prior to clinical presentations of Alzheimer's Disease (AD), neuropathological changes, such as amyloid-ß and brain atrophy, have accumulated at the earlier stages of the disease. The combination of such biomarkers assessed by multiple modalities commonly improves the likelihood of AD etiology. We aimed to explore the discriminative ability of Aß PET features and whether combining Aß PET and structural MRI features can improve the classification performance of the machine learning model in older healthy control (OHC) and mild cognitive impairment (MCI) from AD. MATERIAL AND METHODS: We collected 94 AD patients, 82 MCI patients, and 85 OHC from three different cohorts. 17 global/regional Aß features in Centiloid, 122 regional volume, and 68 regional cortical thickness were extracted as imaging features. Single or combined modality features were used to train the random forest model on the testing set. The top 10 features were sorted based on the Gini index in each binary classification. RESULTS: The results showed that AUC scores were 0.81/0.86 and 0.69/0.68 using sMRI/Aß PET features on the testing set in differentiating OHC and MCI from AD. The performance was improved while combining two-modality features with an AUC of 0.89 and an AUC of 0.71 in two classifications. Compared to sMRI features, particular Aß PET features contributed more to differentiating AD from others. CONCLUSION: Our study demonstrated the discriminative ability of Aß PET features in differentiating AD from OHC and MCI. A combination of Aß PET and structural MRI features can improve the RF model performance.

18.
Sci Total Environ ; 946: 174528, 2024 Oct 10.
Artigo em Inglês | MEDLINE | ID: mdl-38971243

RESUMO

Soil aggregates are crucial for soil organic carbon (OC) accumulation. This study, utilizing a 32-year fertilization experiment, investigates whether the core microbiome can elucidate variations in carbon content and decomposition across different aggregate sizes more effectively than broader bacterial and fungal community analyses. Employing ensemble learning algorithms that integrate machine learning with network inference, we found that the core microbiome accounts for an average increase of 26 % and 20 % in the explained variance of PCoA and Adonis analyses, respectively, in response to fertilization. Compared to the control, inorganic and organic fertilizers decreased the decomposition index (DDI) by 31 % and 38 %, respectively. The fungal core microbiome predominantly influenced OC content and DDI in larger macroaggregates (>2000 µm), explaining over 35 % of the variance, while the bacterial core microbiome had a lesser impact, explaining <30 %. Conversely, in smaller aggregates (<2000 µm), the bacterial core microbiome significantly influenced DDI (R2 > 0.2), and the fungal core microbiome more strongly affected OC content (R2 > 0.3). Mantel tests showed that pH is the most significant environmental factor affecting core microbiome composition across all aggregate sizes (Mantel's r > 0.8, P < 0.01). Linear correlation analysis further confirmed that the core microbiome's community structure could accurately predict OC content and DDI in aggregates (R2 > 0.8, P < 0.05). Overall, our findings suggested that the core microbiome provides deeper insights into the variability of aggregate organic carbon content and decomposition, with the bacterial core microbiome playing a particularly pivotal role within the soil aggregates.


Assuntos
Carbono , Aprendizado de Máquina , Microbiota , Microbiologia do Solo , Solo , Carbono/metabolismo , Carbono/análise , Solo/química , Algoritmos , Fungos/metabolismo , Bactérias/metabolismo , Fertilizantes
19.
Behav Sci (Basel) ; 14(7)2024 Jul 04.
Artigo em Inglês | MEDLINE | ID: mdl-39062385

RESUMO

The frequent turnover of college graduates is a key factor leading to the frictional unemployment and structural unemployment of youth, which are important research fields concerned with pedagogy, sociology, and management; however, there is little research on the prediction of college graduates' turnover. Therefore, this study investigated the turnover status of 17,268 college graduates from 52 universities in China, constructed and optimized a random forest model for predicting the turnover of college graduates, and analyzed the influencing mechanism of college graduates' turnover and the importance of influencing factors. The enhanced random forest model could deal with the unbalanced data and has a higher prediction accuracy as well as stronger generalization ability in predicting the turnover of college graduates. Individual background variables, job characteristic variables, and work environment variables are all important factors influencing whether college graduates resign or not. The top five factors that affect the turnover of college graduates by more than 10% are income level, job satisfaction degree, job opportunities, and job matching degree. The conclusion of this study is conducive to improving the accuracy of turnover prediction, systematically exploring the influencing factors of college graduates' turnover, and effectively guaranteeing the overall stability of youth employment.

20.
Comput Biol Med ; 179: 108880, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39018880

RESUMO

BACKGROUND: The effective management of trauma patients necessitates efficient triaging, timely activation of Massive Blood Transfusion Protocols (MTP), and accurate prediction of in-hospital outcomes. Machine learning (ML) algorithms have emerged as up-and-coming tools in the domains of optimizing triage decisions, improving intervention strategies, and predicting clinical outcomes, consistently outperforming traditional methodologies. This study aimed to develop, assess, and compare several ML models for the triaging processes, activation of MTP, and mortality prediction. METHODS: In a 10-year retrospective study, the predictive capabilities of seven ML models for trauma patients were systematically assessed using on-admission patients' hemodynamic data. All patient's data were randomly divided into training (80 %) and test (20 %) sets. Employing Python for data preprocessing, feature scaling, and model development, we evaluated K-Nearest Neighbors (KNN), Logistic Regression (LR), Decision Tree (DT), Support Vector Machines (SVM) with RBF kernels, Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Artificial Neural Network (ANN). We employed various imputation techniques and addressed data imbalance through down-sampling, up-sampling, and synthetic minority for the over-sampling technique (SMOTE). Hyperparameter tuning, coupled with 5-fold cross-validation, was performed. The evaluation included essential metrics like sensitivity, specificity, F1 score, accuracy, Area Under the Receiver Operating Curve (AUC ROC), and Area Under the Precision recall Curve (AUC PR), ensuring robust predictive capability. RESULT: This study included 17,390 adult trauma patients; of them, 19.5 % (3385) were triaged at a critical level, 3.8 % (664) required MTP, and 7.7 % (1335) died in the hospital. The model's performance improved using imputation and balancing techniques. The overall models demonstrated notable performance metrics for predicting triage, MTP activation, and mortality with F1 scores of 0.75, 0.42, and 0.79, sensitivities of 0.73, 0.82, and 0.9, and AUC ROC values of 0.89, 0.95 and 0.99 respectively. CONCLUSION: Machine learning, especially RF models, effectively predicted trauma triage, MTP activation, and mortality. Featured critical hemodynamic variables include shock indices, systolic blood pressure, and mean arterial pressure. Therefore, models can do better than individual parameters for the early management and disposition of patients in the ED. Future research should focus on creating sensitive and interpretable models to enhance trauma care.


Assuntos
Transfusão de Sangue , Hemodinâmica , Aprendizado de Máquina , Triagem , Ferimentos e Lesões , Humanos , Triagem/métodos , Ferimentos e Lesões/mortalidade , Ferimentos e Lesões/fisiopatologia , Hemodinâmica/fisiologia , Masculino , Feminino , Estudos Retrospectivos , Adulto , Pessoa de Meia-Idade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA