RESUMO
Heavy metal (HM) contamination in agricultural soils has been a serious environmental and health problem in the past decades. High concentration of HM threatens human health and can be a risk factor for many diseases such as stomach cancer. In order to investigate the relationship between HM content and stomach cancer, the under-study area should be adequately large so that the possible relationship between soil contamination and the patients' distribution can be studied. Examining soil content in a vast area with traditional techniques like field sampling is neither practical nor possible. However, integrating remote sensing imagery and spectrometry can provide an unexpensive and effective substitute for detecting HM in soil. To estimate the concentration of arsenic (As), chrome (Cr), lead (Pb), nickel (Ni), and iron (Fe) in agricultural soil in parts of Golestan province with Hyperion image and soil samples, spectral transformations were used to preprocess and highlight spectral features, and Spearman's correlation was calculated to select the best features for detecting each metal. The generalized regression neural network (GRNN) was trained with the chosen spectral features and metal containment, and the trained GRNN generated the pollution maps from the Hyperion image. Mean concentration of Cr, As, Fe, Ni, and Pb was estimated at 40.22, 11.8, 21,530.565, 39.86, and 0.5 mg/kg, respectively. Concentrations of As and Fe were near the standard limit and overlying the pollution maps, and patients' distribution showed high concentrations of these metals can be considered as stomach cancer risk factors.
Assuntos
Metais Pesados , Tecnologia de Sensoriamento Remoto , Poluentes do Solo , Neoplasias Gástricas , Humanos , Arsênio/análise , China/epidemiologia , Monitoramento Ambiental/métodos , Chumbo/análise , Metais Pesados/análise , Níquel/análise , Medição de Risco , Solo/química , Poluentes do Solo/análise , Neoplasias Gástricas/epidemiologiaRESUMO
PURPOSE: Gastric cancer (GC) ranks as the 7th most common cancer worldwide and a leading cause of cancer mortality. In Iran, stomach malignancies are the most common fatal cancers with higher than world average incidence. In recent years, methods like machine learning that provide the opportunity of merging health issues with computational power and learning capacity have caught considerable attention for prediction and diagnosis of diseases. In this study, we aimed to model GC data to find risk factors and identify GC cases in Golestan Cohort Study (GCS), using gradient boosting as a machine learning technique. METHODS: Since the GC class (280) was smaller than not-GC (49,467), "Synthetic Minority Oversampling Technique" was used to balance the dataset. Seventy percent of the data was used to train the gradient boosting algorithm and find effective factors on gastric cancer, and the remaining 30% was used for accuracy assessment. RESULTS: Our results indicated that out of 19 factors, age, social economical status, tea temperature, body mass index, gender, and education were the top six effective factors with impact rates of 0.24, 0.16, 0.13, 0.13, and 0.07, respectively. The trained model classified 70 out of 72 GC patients in the test set, correctly. CONCLUSION: The results indicate that this model can effectively detect gastric cancer (GC) by utilizing important risk factors, thus avoiding the need for invasive procedures. The model's performance is reliable when provided with an adequate amount of input data, and as the dataset expands, its accuracy and generalization improve significantly. Overall, the trained system's success stems from its ability to identify risk factors and identify cancer patients.