Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
Ren Fail ; 46(2): 2380752, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-39039848

RESUMO

CONTEXT: Four algorithms with relatively balanced complexity and accuracy in deep learning classification algorithm were selected for differential diagnosis of primary membranous nephropathy (PMN). OBJECTIVE: This study explored the most suitable classification algorithm for PMN identification, and to provide data reference for PMN diagnosis research. METHODS: A total of 500 patients were referred to Luo-he Central Hospital from 2019 to 2021. All patients were diagnosed with primary glomerular disease confirmed by renal biopsy, contained 322 cases of PMN, the 178 cases of non-PMN. Using the decision tree, random forest, support vector machine, and extreme gradient boosting (Xgboost) to establish a differential diagnosis model for PMN and non-PMN. Based on the true positive rate, true negative rate, false-positive rate, false-negative rate, accuracy, feature work area under the curve (AUC) of subjects, the best performance of the model was chosen. RESULTS: The efficiency of the Xgboost model based on the above evaluation indicators was the highest, which the diagnosis of PMN of the sensitivity and specificity, respectively 92% and 96%. CONCLUSIONS: The differential diagnosis model for PMN was established successfully and the efficiency performance of the Xgboost model was the best. It could be used for the clinical diagnosis of PMN.


Membranous nephropathy (MN) without obvious causes is called primary MN (PMN), This study utilized deep learning classification algorithms for differential diagnosis of PMN and explored the most suitable classification algorithm for PMN recognition, provided data reference for PMN diagnosis research.


Assuntos
Glomerulonefrite Membranosa , Humanos , Glomerulonefrite Membranosa/diagnóstico , Glomerulonefrite Membranosa/patologia , Diagnóstico Diferencial , Masculino , Feminino , Pessoa de Meia-Idade , Adulto , Aprendizado de Máquina , Algoritmos , Sensibilidade e Especificidade , Máquina de Vetores de Suporte , Estudos Retrospectivos , Árvores de Decisões , Aprendizado Profundo , Biópsia
2.
Sensors (Basel) ; 24(3)2024 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-38339556

RESUMO

Truck hoisting detection constitutes a key focus in port security, for which no optimal resolution has been identified. To address the issues of high costs, susceptibility to weather conditions, and low accuracy in conventional methods for truck hoisting detection, a non-intrusive detection approach is proposed in this paper. The proposed approach utilizes a mathematical model and an extreme gradient boosting (XGBoost) model. Electrical signals, including voltage and current, collected by Hall sensors are processed by the mathematical model, which augments their physical information. Subsequently, the dataset filtered by the mathematical model is used to train the XGBoost model, enabling the XGBoost model to effectively identify abnormal hoists. Improvements were observed in the performance of the XGBoost model as utilized in this paper. Finally, experiments were conducted at several stations. The overall false positive rate did not exceed 0.7% and no false negatives occurred in the experiments. The experimental results demonstrated the excellent performance of the proposed approach, which can reduce the costs and improve the accuracy of detection in container hoisting.

3.
Clin Endocrinol (Oxf) ; 98(1): 98-109, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-35171531

RESUMO

OBJECTIVE: Distant metastasis often indicates a poor prognosis, so early screening and diagnosis play a significant role. Our study aims to construct and verify a predictive model based on machine learning (ML) algorithms that can estimate the risk of distant metastasis of newly diagnosed follicular thyroid carcinoma (FTC). DESIGN: This was a retrospective study based on the Surveillance, Epidemiology, and End Results (SEER) database from 2004 to 2015. PATIENTS: A total of 5809 FTC patients were included in the data analysis. Among them, there were 214 (3.68%) cases with distant metastasis. METHOD: Univariate and multivariate logistic regression (LR) analyses were used to determine independent risk factors. Seven commonly used ML algorithms were applied for predictive model construction. We used the area under the receiver-operating characteristic (AUROC) curve to select the best ML algorithm. The optimal model was trained through 10-fold cross-validation and visualized by SHapley Additive exPlanations (SHAP). Finally, we compared it with the traditional LR method. RESULTS: In terms of predicting distant metastasis, the AUROCs of the seven ML algorithms were 0.746-0.836 in the test set. Among them, the Extreme Gradient Boosting (XGBoost) had the best prediction performance, with an AUROC of 0.836 (95% confidence interval [CI]: 0.775-0.897). After 10-fold cross-validation, its predictive power could reach the best [AUROC: 0.855 (95% CI: 0.803-0.906)], which was slightly higher than the classic binary LR model [AUROC: 0.845 (95% CI: 0.818-0.873)]. CONCLUSIONS: The XGBoost approach was comparable to the conventional LR method for predicting the risk of distant metastasis for FTC.


Assuntos
Adenocarcinoma Folicular , Neoplasias da Glândula Tireoide , Humanos , Estudos Retrospectivos , Aprendizado de Máquina , Algoritmos , Neoplasias da Glândula Tireoide/diagnóstico
4.
Environ Monit Assess ; 195(11): 1355, 2023 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-37864622

RESUMO

This paper presents a new remote sensing (RS) algorithm for retrieving instantaneous sea surface solar irradiance (SR) by using the XGBoost (XGB) package in RStudio and available remote sensing data along with ground-truth solar irradiance data. By means of XGB, the new RS algorithm, called LSU model, was structurally built with nine key RS parameters, including photosynthetically available radiation (PAR); instantaneous PAR (iPAR); water leaving reflectance Rrs at wavelengths 412, 443, 469, and 488 nm; angstrom; aerosol optical thickness (aot_869); and latitude that represent major sources and sinks of solar irradiance, as model input variables. Among the nine parameters, the most important four parameters are PAR, iPAR, latitude, and aot_869. It was found that the sea surface SR is highly affected by conditions in both the atmosphere and the seawater. The aot_869 is by far the most important factor describing the effects of the atmospheric absorption and scattering of SR before reaching the sea surface. The most important factors describing the effects of seawater characteristics on solar irradiance are PAR, iPAR, and latitude. Comparisons with existing SR models indicate that LSU model is scientifically sound due to the use of major source and sink factors of SR as model input variables. LSU model is also technically accurate due to its fine resolution (1×1 km) and overall best performance in predicting instantaneous SR. More importantly, LSU model is globally applicable as it can be utilized to obtain global-scale SR data for any day, any time, and anywhere in the world.


Assuntos
Monitoramento Ambiental , Tecnologia de Sensoriamento Remoto , Luz Solar , Algoritmos , Água do Mar
5.
BMC Infect Dis ; 21(1): 839, 2021 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-34412581

RESUMO

BACKGROUND: Hemorrhagic fever with renal syndrome (HFRS) is still attracting public attention because of its outbreak in various cities in China. Predicting future outbreaks or epidemics disease based on past incidence data can help health departments take targeted measures to prevent diseases in advance. In this study, we propose a multistep prediction strategy based on extreme gradient boosting (XGBoost) for HFRS as an extension of the one-step prediction model. Moreover, the fitting and prediction accuracy of the XGBoost model will be compared with the autoregressive integrated moving average (ARIMA) model by different evaluation indicators. METHODS: We collected HFRS incidence data from 2004 to 2018 of mainland China. The data from 2004 to 2017 were divided into training sets to establish the seasonal ARIMA model and XGBoost model, while the 2018 data were used to test the prediction performance. In the multistep XGBoost forecasting model, one-hot encoding was used to handle seasonal features. Furthermore, a series of evaluation indices were performed to evaluate the accuracy of the multistep forecast XGBoost model. RESULTS: There were 200,237 HFRS cases in China from 2004 to 2018. A long-term downward trend and bimodal seasonality were identified in the original time series. According to the minimum corrected akaike information criterion (CAIC) value, the optimal ARIMA (3, 1, 0) × (1, 1, 0)12 model is selected. The index ME, RMSE, MAE, MPE, MAPE, and MASE indices of the XGBoost model were higher than those of the ARIMA model in the fitting part, whereas the RMSE of the XGBoost model was lower. The prediction performance evaluation indicators (MAE, MPE, MAPE, RMSE and MASE) of the one-step prediction and multistep prediction XGBoost model were all notably lower than those of the ARIMA model. CONCLUSIONS: The multistep XGBoost prediction model showed a much better prediction accuracy and model stability than the multistep ARIMA prediction model. The XGBoost model performed better in predicting complicated and nonlinear data like HFRS. Additionally, Multistep prediction models are more practical than one-step prediction models in forecasting infectious diseases.


Assuntos
Febre Hemorrágica com Síndrome Renal , China/epidemiologia , Previsões , Febre Hemorrágica com Síndrome Renal/epidemiologia , Humanos , Incidência , Modelos Estatísticos , Estações do Ano
6.
BMC Pulm Med ; 21(1): 320, 2021 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-34654406

RESUMO

BACKGROUND: To investigate the risk factors and construct a logistic model and an extreme gradient boosting (XGBoost) model to compare the predictive performances for readmission in acute exacerbation of chronic obstructive pulmonary disease (AECOPD) patients within one year. METHODS: In total, 636 patients with AECOPD were recruited and divided into readmission group (n = 449) and non-readmission group (n = 187). Backward stepwise regression method was used to analyze the risk factors for readmission. Data were divided into training set and testing set at a ratio of 7:3. Variables with statistical significance were included in the logistic model and variables with P < 0.1 were included in the XGBoost model, and receiver operator characteristic (ROC) curves were plotted. RESULTS: Patients with acute exacerbations within the previous 1 year [odds ratio (OR) = 4.086, 95% confidence interval (CI) 2.723-6.133, P < 0.001), long-acting ß agonist (LABA) application (OR = 4.550, 95% CI 1.587-13.042, P = 0.005), inhaled corticosteroids (ICS) application (OR = 0.227, 95% CI 0.076-0.672, P = 0.007), glutamic-pyruvic transaminase (ALT) level (OR = 0.985, 95% CI 0.971-0.999, P = 0.042), and total CAT score (OR = 1.091, 95% CI 1.048-1.136, P < 0.001) were associated with the risk of readmission. The AUC value of the logistic model was 0.743 (95% CI 0.692-0.795) in the training set and 0.699 (95% CI 0.617-0.780) in the testing set. The AUC value of XGBoost model was 0.814 (95% CI 0.812-0.815) in the training set and 0.722 (95% CI 0.720-0.725) in the testing set. CONCLUSIONS: The XGBoost model showed a better predictive value in predicting the risk of readmission within one year in the AECOPD patients than the logistic regression model. The findings of our study might help identify patients with a high risk of readmission within one year and provide timely treatment to prevent the reoccurrence of AECOPD.


Assuntos
Readmissão do Paciente , Doença Pulmonar Obstrutiva Crônica , Medição de Risco/métodos , Doença Aguda , Corticosteroides/uso terapêutico , Idoso , Idoso de 80 Anos ou mais , China , Progressão da Doença , Feminino , Humanos , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Alta do Paciente , Readmissão do Paciente/estatística & dados numéricos , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/tratamento farmacológico , Doença Pulmonar Obstrutiva Crônica/fisiopatologia , Fatores de Risco , Tempo
7.
Ecotoxicol Environ Saf ; 204: 111059, 2020 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-32791357

RESUMO

Exploring the Manganese (Mn) removal prediction with several independent variables is tremendously critical and indispensable to understand the pattern of removal process. Mn is one of the key heavy metals (HMs) stipulated by the WHO for the development of many attributes of the ecosystem in controlled quantity. In the present paper, an extreme gradient model (XGBoost) is proposed for Mn prediction. A compressive statistical analysis reveals the stochastics behaviour of the data prior to the prediction investigation. The main goal is to determine the Mn predictability of XGBoost algorithm with influencing factors such as D2EHPA (M), Time (min), H2SO4 (M), NaCl (g/L), and EDTA (mM). The PCA biplot signifies the importance of the predictors. The XGBoost model validated against a diversity of data-driven models such as multilinear regression (MLR), support vector machine (SVM), and random forest (RF). The order of the applied models' performance are XGBoost > RF > SVM > MLR as per their R2 and RMSE metrics over testing phase i.e. 20.88, 0.75, 0.61, 0.40, and 2.23, 3.01, 3.51, 6.38, respectively. Moreover, the Taylor diagram and Radar chart have drown to emphasize the XGBoost model efficiency, stability, and reliability. In respect of XGBoost model prediction, 'Time' predictor outperforms D2EHPA, EDTA, H2SO4, and NaCl predictors in order.


Assuntos
Água Doce/química , Manganês/análise , Modelos Teóricos , Máquina de Vetores de Suporte , Poluentes Químicos da Água/análise , Algoritmos , Ecossistema , Previsões , Aprendizado de Máquina
8.
Ying Yong Sheng Tai Xue Bao ; 35(3): 789-796, 2024 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-38646767

RESUMO

We established the optimal model by using the automatic machine learning method to predict the degradation efficiency of herbicide atrazine in soil, which could be used to assess the residual risk of atrazine in soil. We collected 494 pairs of data from 49 published articles, and selected seven factors as input features, including soil pH, organic matter content, saturated hydraulic conductivity, soil moisture, initial concentration of atrazine, incubation time, and inoculation dose. Using the first-order reaction rate constant of atrazine in soil as the output feature, we established six models to predict the degradation efficiency of atrazine in soil, and conducted comprehensive analysis of model performance through linear regression and related evaluation indicators. The results showed that the XGBoost model had the best performance in predicting the first-order reaction rate constant (k). Based on the prediction model, the feature importance ranking of each factor was in an order of soil moisture > incubation time > pH > organic matter > initial concentration of atrazine > saturated hydraulic conductivity > inoculation dose. We used SHAP to explain the potential relationship between each feature and the degradation ability of atrazine in soil, as well as the relative contribution of each feature. Results of SHAP showed that time had a negative contribution and saturated hydraulic conductivity had a positive contribution. High values of soil moisture, initial concentration of atrazine, pH, inoculation dose and organic matter content were generally distributed on both sides of SHAP=0, indicating their complex contributions to the degradation of atrazine in soil. The XGBoost model method combined with the SHAP method had high accuracy in predicting the performance and interpretability of the k model. By using machine learning method to fully explore the value of historical experimental data and predict the degradation efficiency of atrazine using environmental parameters, it is of great significance to set the threshold for atrazine application, reduce the residual and diffusion risks of atrazine in soil, and ensure the safety of soil environment.


Assuntos
Atrazina , Herbicidas , Modelos Teóricos , Poluentes do Solo , Solo , Atrazina/análise , Atrazina/química , Poluentes do Solo/análise , Poluentes do Solo/química , Herbicidas/análise , Herbicidas/química , Solo/química , Biodegradação Ambiental , Aprendizado de Máquina , Previsões
9.
Med Phys ; 51(6): 4536-4545, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38639653

RESUMO

BACKGROUND: Plane-parallel ionization chambers are the recommended secondary standard systems for clinical reference dosimetry of electrons. Dosimetry in high dose rate and dose-per-pulse (DPP) is challenging as ionization chambers are subject to ion recombination, especially when dose rate and/or DPP is increased beyond the range of conventional radiotherapy. The lack of universally accepted models for correction of ion recombination in UDHR is still an issue as it is, especially in FLASH-RT research, which is crucial in order to be able to accurately measure the dose for a wide range of dose rates and DPPs. PURPOSE: The objective of this study was to show the feasibility of developing an Artificial Intelligence model to predict the ion-recombination factor-ksat for a plane-parallel Advanced Markus ionization chamber for conventional and ultra-high dose rate electron beams based on machine parameters. In addition, the predicted ksat of the AI model was compared with the current applied analytical models for this correction factor. METHODS: A total number of 425 measurements was collected with a balanced variety in machine parameter settings. The specific ksat values were determined by dividing the output of the reference dosimeter (optically stimulated luminescence [OSL]) by the output of the AM chamber. Subsequently, a XGBoost regression model was trained, which used the different machine parameters as input features and the corresponding ksat value as output. The prediction accuracy of this regression model was characterized by R2-coefficient of determination, mean absolute error and root mean squared error. In addition, the model was compared with the Two-Voltage (TVA) method and empirical Petersson model for 19 different dose-per-pulse values ranging from conventional to UDHR regimes. The Akiake Information criterion (AIC) was calculated for the three different models. RESULTS: The XGBoost regression model reached a R2-score of 0.94 on the independent test set with a MAE of 0.067 and RMSE of 0.106. For the additional 19 random data points, the ksat values predicted by the XGBoost model showed to be in agreement, within the uncertainties, with the ones determined by the Petersson model and better than the TVA method for doses per pulse >3.5 Gy with a maximum deviation from the ground truth of 14.2%, 16.7%, and -36.0%, respectively, for DPP >4 Gy. CONCLUSION: The proposed method of using AI for ksat determination displays efficiency. For the investigated DPPs, the ksat values obtained with the XGBoost model were in concurrence with the ones obtained with the current available analytical models within the boundaries of uncertainty, certainly for the DPP characterizing UDHR. But the overall performance of the AI model, taking the number of free parameters into account, lacked efficiency. Future research should optimize the determination of the experimental ksat, and investigate the determination the ksat for DPPs higher than the ones investigated in this study, while also evaluating the prediction of the proposed XGBoost model for UDHR machines of different centers.


Assuntos
Elétrons , Radiometria , Dosagem Radioterapêutica , Elétrons/uso terapêutico , Radiometria/instrumentação , Radiometria/métodos , Automação , Doses de Radiação , Inteligência Artificial
10.
Front Mol Biosci ; 11: 1436135, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39193220

RESUMO

Introduction: Individuals with diabetes mellitus (DM) are at an increased risk of Mycobacterium tuberculosis (Mtb) infection and progressing from latent tuberculosis (TB) infection to active tuberculosis disease. TB in the DM population is more likely to go undiagnosed due to smear-negative results. Methods: Exhaled breath samples were collected and analyzed using high-pressure photon ionization time-of-flight mass spectrometry. An eXtreme Gradient Boosting (XGBoost) model was utilized for breathomics analysis and TB detection. Results: XGBoost model achieved a sensitivity of 88.5%, specificity of 100%, accuracy of 90.2%, and an area under the curve (AUC) of 98.8%. The most significant feature across the entire set was m106, which demonstrated a sensitivity of 93%, specificity of 100%, and an AUC of 99.7%. Discussion: The breathomics-based TB detection method utilizing m106 exhibited high sensitivity and specificity potentially beneficial for clinical TB screening and diagnosis in individuals with diabetes.

11.
Accid Anal Prev ; 203: 107601, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38718664

RESUMO

The driver's takeover time is crucial to ensure a safe takeover transition in conditional automated driving. The study aimed to construct a prediction model of driver's takeover time based on individual characteristics, external environment, and situation awareness variables. A total of 18 takeover events were designed with scenarios, non-driving-related tasks, takeover request time, and traffic flow as variables. High-fidelity driving simulation experiments were carried out, through which the driver's takeover data was obtained. Fifteen basic factors and three dynamic factors were extracted from individual characteristics, external environment, and situation awareness. In this experiment, these 18 factors were selected as input variables, and XGBoost and Shapely were used as prediction methods. A takeover time prediction model (BM + SA model) was then constructed. Moreover, we analyzed the main effect of input variables on takeover time, and the interactive contribution made by the variables. And in this experiment, the 15 basic factors were selected as input variables, and the basic takeover time prediction model (BM model) was constructed. In addition, this study compared the performance of the two models and analyzed the contribution of input variables to takeover time. The results showed that the goodness of fit of the BM + SA model (Adjusted_R2) was 0.7746. The XGBoost model performs better than other models (support vector machine, random forest, CatBoost, and LightBoost models). The relative importance degree of situation awareness variables, individual characteristic variables, and external environment variables to takeover time gradually reduced. Takeover time increased with the scan and gaze durations and decreased with pupil area and self-reported situation awareness scores. There was also an interaction effect between the variables to affect takeover time. Overall, the performance of the BM + SA model was better than that of the BM model. This study can provide support for predicting driver's takeover time and analyzing the mechanism of influence on takeover time. This study can provide support for the development of real-time driver's takeover ability prediction systems and optimization of human-machine interaction design in automated vehicles, as well as for the management department to evaluate and improve the driver's takeover performance in a targeted manner.


Assuntos
Condução de Veículo , Conscientização , Humanos , Condução de Veículo/psicologia , Masculino , Adulto , Feminino , Fatores de Tempo , Simulação por Computador , Adulto Jovem , Meio Ambiente , Modelos Teóricos , Automação
12.
Water Res ; 250: 121056, 2024 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-38171175

RESUMO

The dynamic changes between toxic and non-toxic strains of Microcystis blooms have always been a hot topic. Previous studies have found that low CO2 favors toxic strains, but how changing dissolved CO2 (CO2 [aq]) in water body influences the succession of toxic and non-toxic strains in Microcystis blooms remains uncertain. Here, we combined laboratory competition experiments, field observations, and a machine learning model to reveal the links between CO2 changes and the succession. Laboratory experiments showed that under low CO2 conditions (100-150 ppm), the toxic strains could make better use of CO2 (aq) and be dominant. The non-toxic strains demonstrated a growth advantage as CO2 concentration increased (400-1000 ppm). Field observations from June to November in Lake Taihu showed that the percentage of toxic strains increased as CO2 (aq) decreased. Machine learning highlighted links between the inorganic carbon concentration and the proportion of advantageous strains. Our findings provide new insights for cyanoHABs prediction and prevention.


Assuntos
Microcystis , Dióxido de Carbono , Microcistinas , Lagos , Carbono , China
13.
World J Gastroenterol ; 30(7): 631-635, 2024 Feb 21.
Artigo em Inglês | MEDLINE | ID: mdl-38515945

RESUMO

In this editorial, we comment on the article by Zhang et al entitled Development of a machine learning-based model for predicting the risk of early postoperative recurrence of hepatocellular carcinoma. Hepatocellular carcinoma (HCC), which is characterized by high incidence and mortality rates, remains a major global health challenge primarily due to the critical issue of postoperative recurrence. Early recurrence, defined as recurrence that occurs within 2 years posttreatment, is linked to the hidden spread of the primary tumor and significantly impacts patient survival. Traditional predictive factors, including both patient- and treatment-related factors, have limited predictive ability with respect to HCC recurrence. The integration of machine learning algorithms is fueled by the exponential growth of computational power and has revolutionized HCC research. The study by Zhang et al demonstrated the use of a groundbreaking preoperative prediction model for early postoperative HCC recurrence. Chall-enges persist, including sample size constraints, issues with handling data, and the need for further validation and interpretability. This study emphasizes the need for collaborative efforts, multicenter studies and comparative analyses to validate and refine the model. Overcoming these challenges and exploring innovative approaches, such as multi-omics integration, will enhance personalized oncology care. This study marks a significant stride toward precise, effi-cient, and personalized oncology practices, thus offering hope for improved patient outcomes in the field of HCC treatment.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Humanos , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/prevenção & controle , Carcinoma Hepatocelular/cirurgia , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/prevenção & controle , Neoplasias Hepáticas/cirurgia , Algoritmos , Aprendizado de Máquina , Oncologia
14.
Materials (Basel) ; 16(2)2023 Jan 09.
Artigo em Inglês | MEDLINE | ID: mdl-36676391

RESUMO

Calcium aluminate cement (CAC) has been explored as a sustainable alternative to Portland cement, the most widely used type of cement. However, the hydration reaction and mechanical properties of CAC can be influenced by various factors such as water content, Li2CO3 content, and age. Due to the complex interactions between the precursors in CAC, traditional analytical models have struggled to predict CAC binders' compressive strength and porosity accurately. To overcome this limitation, this study utilizes machine learning (ML) to predict the properties of CAC. The study begins by using thermodynamic simulations to determine the phase assemblages of CAC at different ages. The XGBoost model is then used to predict the compressive strength, porosity, and hydration products of CAC based on the mixture design and age. The XGBoost model is also used to evaluate the influence of input parameters on the compressive strength and porosity of CAC. Based on the results of this analysis, a closed-form analytical model is developed to predict the compressive strength and porosity of CAC accurately. Overall, the study demonstrates that ML can be effectively used to predict the properties of CAC binders, providing a valuable tool for researchers and practitioners in the field of cement science.

15.
Environ Sci Pollut Res Int ; 30(36): 85184-85197, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-37380860

RESUMO

In recent years, . the rapid development of the Yangtze River Delta in China has led to increasingly serious regional eco-environmental problems. Therefore, it is of great significance for the construction of ecological civilization to study the ecosystem health in the Yangtze River Delta. In this paper, the assessment framework of "Vigor-Organization-Resilience" was used to assess the ecosystem health index (EHI) of the Yangtze River Delta from 2000 to 2020, and then the spatial autocorrelation method was used to analyze the agglomeration of EHI in 314 counties in this region. Finally, the eXtreme Gradient Boosting (XGBoost) model and the SHapley Additive exPlanation (SHAP) model were combined to explore the synergistic impact of EHI driving factors. The results show that (1) from 2000 to 2020, the EHI in the Yangtze River Delta is at the level of ordinary health, and gradually decreased; (2) the EHI has significant spatial positive correlation and aggregation; (3) the driving factors in descending order of importance are urbanization level (UL), precipitation (PRE), PM2.5 (PM), normalized difference vegetation index (NDVI), and temperature (TEMP); and (4) the relationship between UL and EHI is logarithmic; PRE and EHI are quartic polynomial; PM, NDVI, TEMP, and EHI are quadratic polynomial. The results of this paper are of great significance to the management and restoration of the ecosystem in this region.


Assuntos
Ecossistema , Rios , Urbanização , Temperatura , China , Cidades
16.
Sci Total Environ ; 858(Pt 1): 159798, 2023 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-36309269

RESUMO

Considering the high toxicity of arsenic (As), its contamination of soil represents an alarming environmental and public health issue. Existing soil heavy metal concentration estimation models based on hyperspectral data ignore the spatial nonstationarity of the relationship between the soil spectrum and heavy metal concentration. A novel model (geographically weighted eXtreme gradient boosting or GW-XGBoost model) combining geographically weighted regression (GWR) method with XGBoost algorithm was proposed. The northeast district of Beijing, China, was chosen as a case study area to assess the effectiveness of the proposed model. The GW-XGBoost model was established to estimate the As concentration based on the typical spectrum of As and the spatial correlation between the spectrum and As concentration obtained using the GWR method, and the result was compared to that obtained with the XGBoost and GWR models. The accuracy of the GW-XGBoost model was obviously better than that of the other models (R2GW-XGBoost = 0.90, R2XGBoost = 0.48, and R2GWR = 0.74). Therefore, the proposed model is reliable, as it considers the spatial correlation between the spectrum and As concentration.


Assuntos
Arsênio , Metais Pesados , Solo , Monitoramento Ambiental/métodos , Regressão Espacial , China
17.
Huan Jing Ke Xue ; 44(7): 3738-3748, 2023 Jul 08.
Artigo em Chinês | MEDLINE | ID: mdl-37438273

RESUMO

Aerosol optical depths of satellites and meteorological factors have been widely used to estimate concentrations of surface particulate matter with an aerodynamic diameter ≤ 2.5 µm. Research on a high time resolution and high-precision PM2.5 concentration estimation method is of great significance for timely and accurate air quality prediction and air pollution prevention and mitigation. Himawari-8 AOD hour product and ERA5 meteorological reanalysis data were used as estimation variables, and a GTWR-XGBoost combined model was proposed to estimate hourly PM2.5 concentration in Sichuan Province. The results showed that:① the performance of the proposed combination model was better than that of the KNN, RF, AdaBoost, GTWR, GTWR-KNN, GTWR-RF, and GTWR-AdaBoost models in the full dataset; the fitting accuracy indexes R2, MAE, and RMSE were 0.96, 3.43 µg·m-3, and 5.52 µg·m-3, respectively; and the verification accuracy indexes R2, MAE, and RMSE were 0.9, 4.98 µg·m-3, and 7.92 µg·m-3, respectively. ② The model had a high goodness of fit (R2 of the whole dataset was 0.96, and R2 of different times ranged from 0.91 to 0.98) when applied to the estimation of PM2.5 concentration hour. It showed that the model had good time stability for hourly estimation and could provide accurate estimation information for regional air quality assessment. ③ In terms of time, the annual average PM2.5hourly concentration estimation showed an inverted U-shaped trend. It began to increase gradually at 09:00 am to a peak of 44.56 µg·m-3 at 11:00 and then gradually decreased. Moreover, the seasonal variation was very obvious, with winter>spring>autumn>summer. ④ In terms of spatial distribution, it showed the characteristics of high in the east and low in the west and a high degree of local pollution.

18.
Front Med (Lausanne) ; 10: 1105854, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37056727

RESUMO

Introduction: Intrinsically, chronic obstructive pulmonary disease (COPD) is a highly heterogonous disease. Several sex differences in COPD, such as risk factors and prevalence, were identified. However, sex differences in clinical features of acute exacerbation chronic obstructive pulmonary disease (AECOPD) were not well explored. Machine learning showed a promising role in medical practice, including diagnosis prediction and classification. Then, sex differences in clinical manifestations of AECOPD were explored by machine learning approaches in this study. Methods: In this cross-sectional study, 278 male patients and 81 female patients hospitalized with AECOPD were included. Baseline characteristics, clinical symptoms, and laboratory parameters were analyzed. The K-prototype algorithm was used to explore the degree of sex differences. Binary logistic regression, random forest, and XGBoost models were performed to identify sex-associated clinical manifestations in AECOPD. Nomogram and its associated curves were established to visualize and validate binary logistic regression. Results: The predictive accuracy of sex was 83.930% using the k-prototype algorithm. Binary logistic regression revealed that eight variables were independently associated with sex in AECOPD, which was visualized by using a nomogram. The AUC of the ROC curve was 0.945. The DCA curve showed that the nomogram had more clinical benefits, with thresholds from 0.02 to 0.99. The top 15 sex-associated important variables were identified by random forest and XGBoost, respectively. Subsequently, seven clinical features, including smoking, biomass fuel exposure, GOLD stages, PaO2, serum potassium, serum calcium, and blood urea nitrogen (BUN), were concurrently identified by three models. However, CAD was not identified by machine learning models. Conclusions: Overall, our results support that the clinical features differ markedly by sex in AECOPD. Male patients presented worse lung function and oxygenation, less biomass fuel exposure, more smoking, renal dysfunction, and hyperkalemia than female patients with AECOPD. Furthermore, our results also suggest that machine learning is a promising and powerful tool in clinical decision-making.

19.
ACS Synth Biol ; 11(1): 92-102, 2022 01 21.
Artigo em Inglês | MEDLINE | ID: mdl-34927418

RESUMO

Promoters are one of the most critical regulatory elements controlling metabolic pathways. However, the fast and accurate prediction of promoter strength remains challenging, leading to time- and labor-consuming promoter construction and characterization processes. This dilemma is caused by the lack of a big promoter library that has gradient strengths, broad dynamic ranges, and clear sequence profiles that can be used to train an artificial intelligence model of promoter strength prediction. To overcome this challenge, we constructed and characterized a mutant library of Trc promoters (Ptrc) using 83 rounds of mutation-construction-screening-characterization engineering cycles. After excluding invalid mutation sites, we established a synthetic promoter library that consisted of 3665 different variants, displaying an intensity range of more than two orders of magnitude. The strongest variant was ∼69-fold stronger than the original Ptrc and 1.52-fold stronger than a 1 mM isopropyl-ß-d-thiogalactoside-driven PT7 promoter, with an ∼454-fold difference between the strongest and weakest expression levels. Using this synthetic promoter library, different machine learning models were built and optimized to explore the relationships between promoter sequences and transcriptional strength. Finally, our XgBoost model exhibited optimal performance, and we utilized this approach to precisely predict the strength of artificially designed promoter sequences (R2 = 0.88, mean absolute error = 0.15, and Pearson correlation coefficient = 0.94). Our work provides a powerful platform that enables the predictable tuning of promoters to achieve optimal transcriptional strength.


Assuntos
Inteligência Artificial , Redes e Vias Metabólicas , Biblioteca Gênica , Aprendizado de Máquina , Regiões Promotoras Genéticas/genética
20.
Artigo em Chinês | MEDLINE | ID: mdl-34979613

RESUMO

Objective:To explore the combination of sinus CT score and serum allergen sIgE to construct a postoperative recurrence risk model for patients with eosinophilic CRSwNP. Methods:The clinical data of 183 patients with eosinophilic CRSwNP who were treated in Luohe Central Hospital from January 2016 to January 2019 were collected. The curative effect was evaluated one year after the operation. According to the postoperative recurrence, they were divided into recurrence group and non-recurrence group. Single factor analysis of clinical and pathological factors on the postoperative curative effect of patients, XGboost model and multivariate Cox analysis of factors affecting postoperative recurrence. Draw the receiver operating characteristic(ROC) curves of the two models to compare the prediction effects of the XGboost model. The Kaplan-Meier method draws survival curve and compares the recurrence-free survival rate of patients with different risk levels. Results:The results of Cox multivariate analysis showed postoperative adherence to comprehensive treatment, tissue EOS ratio, tissue NEU ratio, tissue lymphocyte ratio, tissue plasma cell ratio, peripheral blood NEU ratio, Allergen sIgE and total sinus CT score were independent risk factors for recurrence. The top six factors influencing postoperative recurrence in the XGboost model were allergen sIgE, total sinus CT score, tissue EOS ratio, postoperative adherence to comprehensive treatment, tissue lymphocyte ratio, and tissue plasma cell ratio. The ROC curve showed that the area under the ROC curve of the XGboost model was 0.818. Cox analysis (0.789) with more factors increased by 3.68%, and the sensitivity, specificity and Youden index of the model were significantly higher than the multivariate Cox analysis model. The factors included in the XGboost model were used to construct a postoperative recurrence risk model. The recurrence-free survival rate of high-risk group was significantly lower than that of low-risk group and intermediate-risk group (log-rank test value:21.946, P<0.001). Conclusion:The postoperative recurrence risk model established by the sinus CT score combined with serum allergen sIgE can effectively predict the incidence of postoperative recurrence in patients. The XGboost model is better than the multivariate Cox analysis model in predicting postoperative recurrence in patients with eosinophilic CRSwNP. It can be used to predict postoperative recurrence.


Assuntos
Pólipos Nasais , Rinite , Sinusite , Alérgenos , Doença Crônica , Eosinófilos , Humanos , Tomografia Computadorizada por Raios X
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa