Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
1.
BMC Med Inform Decis Mak ; 24(1): 121, 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38724966

RESUMEN

OBJECTIVE: Hospitals and healthcare providers should assess and compare the quality of care given to patients and based on this improve the care. In the Netherlands, hospitals provide data to national quality registries, which in return provide annual quality indicators. However, this process is time-consuming, resource intensive and risks patient privacy and confidentiality. In this paper, we presented a multicentric 'Proof of Principle' study for federated calculation of quality indicators in patients with colorectal cancer. The findings suggest that the proposed approach is highly time-efficient and consume significantly lesser resources. MATERIALS AND METHODS: Two quality indicators are calculated in an efficient and privacy presevering federated manner, by i) applying the Findable Accessible Interoperable and Reusable (FAIR) data principles and ii) using the Personal Health Train (PHT) infrastructure. Instead of sharing data to a centralized registry, PHT enables analysis by sending algorithms and sharing only insights from the data. RESULTS: ETL process extracted data from the Electronic Health Record systems of the hospitals, converted them to FAIR data and hosted in RDF endpoints within each hospital. Finally, quality indicators from each center are calculated using PHT and the mean result along with the individual results plotted. DISCUSSION AND CONCLUSION: PHT and FAIR data principles can efficiently calculate quality indicators in a privacy-preserving federated approach and the work can be scaled up both nationally and internationally. Despite this, application of the methodology was largely hampered by ELSI issues. However, the lessons learned from this study can provide other hospitals and researchers to adapt to the process easily and take effective measures in building quality of care infrastructures.


Asunto(s)
Neoplasias Colorrectales , Registros Electrónicos de Salud , Indicadores de Calidad de la Atención de Salud , Humanos , Neoplasias Colorrectales/terapia , Indicadores de Calidad de la Atención de Salud/normas , Países Bajos , Registros Electrónicos de Salud/normas , Sistema de Registros/normas
2.
Life (Basel) ; 14(2)2024 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-38398771

RESUMEN

Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.

4.
NPJ Sci Food ; 7(1): 46, 2023 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-37658060

RESUMEN

Ensuring safe and healthy food is a big challenge due to the complexity of food supply chains and their vulnerability to many internal and external factors, including food fraud. Recent research has shown that Artificial Intelligence (AI) based algorithms, in particularly data driven Bayesian Network (BN) models, are very suitable as a tool to predict future food fraud and hence allowing food producers to take proper actions to avoid that such problems occur. Such models become even more powerful when data can be used from all actors in the supply chain, but data sharing is hampered by different interests, data security and data privacy. Federated learning (FL) may circumvent these issues as demonstrated in various areas of the life sciences. In this research, we demonstrate the potential of the FL technology for food fraud using a data driven BN, integrating data from different data owners without the data leaving the database of the data owners. To this end, a framework was constructed consisting of three geographically different data stations hosting different datasets on food fraud. Using this framework, a BN algorithm was implemented that was trained on the data of different data stations while the data remained at its physical location abiding by privacy principles. We demonstrated the applicability of the federated BN in food fraud and anticipate that such framework may support stakeholders in the food supply chain for better decision-making regarding food fraud control while still preserving the privacy and confidentiality nature of these data.

5.
J Biomed Inform ; 143: 104404, 2023 07.
Artículo en Inglés | MEDLINE | ID: mdl-37268168

RESUMEN

A large amount of personal health data that is highly valuable to the scientific community is still not accessible or requires a lengthy request process due to privacy concerns and legal restrictions. As a solution, synthetic data has been studied and proposed to be a promising alternative to this issue. However, generating realistic and privacy-preserving synthetic personal health data retains challenges such as simulating the characteristics of the patients' data that are in the minority classes, capturing the relations among variables in imbalanced data and transferring them to the synthetic data, and preserving individual patients' privacy. In this paper, we propose a differentially private conditional Generative Adversarial Network model (DP-CGANS) consisting of data transformation, sampling, conditioning, and network training to generate realistic and privacy-preserving personal data. Our model distinguishes categorical and continuous variables and transforms them into latent space separately for better training performance. We tackle the unique challenges of generating synthetic patient data due to the special data characteristics of personal health data. For example, patients with a certain disease are typically the minority in the dataset and the relations among variables are crucial to be observed. Our model is structured with a conditional vector as an additional input to present the minority class in the imbalanced data and maximally capture the dependency between variables. Moreover, we inject statistical noise into the gradients in the networking training process of DP-CGANS to provide a differential privacy guarantee. We extensively evaluate our model with state-of-the-art generative models on personal socio-economic datasets and real-world personal health datasets in terms of statistical similarity, machine learning performance, and privacy measurement. We demonstrate that our model outperforms other comparable models, especially in capturing the dependence between variables. Finally, we present the balance between data utility and privacy in synthetic data generation considering the different data structures and characteristics of real-world personal health data such as imbalanced classes, abnormal distributions, and data sparsity.


Asunto(s)
Aprendizaje Automático , Privacidad , Humanos , Grupos Minoritarios
6.
Cancers (Basel) ; 15(12)2023 Jun 16.
Artículo en Inglés | MEDLINE | ID: mdl-37370819

RESUMEN

LARC is managed by multimodal treatments whose intensity can be highly modulated. In this context, we need surrogate endpoints to help predict long-term outcomes and better personalize treatments. A previous study identified 2yDFS as a stronger predictor of OS than pCR in LARC patients undergoing neoadjuvant RT. The aim of this pooled analysis was to assess the role of pCR and 2yDFS as surrogate endpoints for OS in a larger cohort. The pooled and subgroup analyses were performed on large rectal cancer randomized trial cohorts who received long-course RT. Our analysis focused on the evaluation of OS in relation to the pCR and 2-year disease status. A total of 4600 patients were analyzed. Four groups were identified according to intermediate outcomes: 12% had both pCR and 2yDFS (the better); 67% achieved 2yDFS but not pCR (the good); 1% had pCR but not 2yDFS; and 20% had neither pCR nor 2yDFS (the bad). The pCR and 2yDFS were favorably associated with OS in the univariate analysis, and 2yDFS maintained a statistically significant association in the multivariate analysis independently of the pCR status. The combination of the pCR and 2yDFS results in a strong predictor of OS, whereas failure to achieve 2yDFS carries a poor prognosis regardless of the pCR status. This new stratification of LARC patients could help design predictive models where the combination of 2yDFS and pCR should be employed as the primary outcome.

7.
Front Oncol ; 13: 1099994, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36925935

RESUMEN

Purpose: Artificial intelligence applications in radiation oncology have been the focus of study in the last decade. The introduction of automated and intelligent solutions for routine clinical tasks, such as treatment planning and quality assurance, has the potential to increase safety and efficiency of radiotherapy. In this work, we present a multi-institutional study across three different institutions internationally on a Bayesian network (BN)-based initial plan review assistive tool that alerts radiotherapy professionals for potential erroneous or suboptimal treatment plans. Methods: Clinical data were collected from the oncology information systems in three institutes in Europe (Maastro clinic - 8753 patients treated between 2012 and 2020) and the United States of America (University of Vermont Medical Center [UVMMC] - 2733 patients, University of Washington [UW] - 6180 patients, treated between 2018 and 2021). We trained the BN model to detect potential errors in radiotherapy treatment plans using different combinations of institutional data and performed single-site and cross-site validation with simulated plans with embedded errors. The simulated errors consisted of three different categories: i) patient setup, ii) treatment planning and iii) prescription. We also compared the strategy of using only diagnostic parameters or all variables as evidence for the BN. We evaluated the model performance utilizing the area under the receiver-operating characteristic curve (AUC). Results: The best network performance was observed when the BN model is trained and validated using the dataset in the same center. In particular, the testing and validation using UVMMC data has achieved an AUC of 0.92 with all parameters used as evidence. In cross-validation studies, we observed that the BN model performed better when it was trained and validated in institutes with similar technology and treatment protocols (for instance, when testing on UVMMC data, the model trained on UW data achieved an AUC of 0.84, compared with an AUC of 0.64 for the model trained on Maastro data). Also, combining training data from larger clinics (UW and Maastro clinic) and using it on smaller clinics (UVMMC) leads to satisfactory performance with an AUC of 0.85. Lastly, we found that in general the BN model performed better when all variables are considered as evidence. Conclusion: We have developed and validated a Bayesian network model to assist initial treatment plan review using multi-institutional data with different technology and clinical practices. The model has shown good performance even when trained on data from clinics with divergent profiles, suggesting that the model is able to adapt to different data distributions.

8.
Med Phys ; 50(2): 1044-1050, 2023 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-36493420

RESUMEN

The registration of multi-source radiation oncology data is a time-consuming and labour-intensive procedure. The standardisation of data collection offers the possibility for the acquisition of quality data for research and clinical purposes. With this study, we present an overview of the different tumour group data lists in the Dutch national proton therapy registry. Furthermore, as a representative example of the workings of these different tumour-specific knowledge graphs, we present the FAIR (Findable, Accessible, Interoperable, Reusable) data principles-compliant knowledge graph approach describing the head and neck tumour variables using radiotherapy domain ontologies and semantic web technologies. Our goal is to provide the radiotherapy community with a flexible and interoperable data model for data exchange between centres. We highlight data variables that are needed for models used in the model-based approach (MBA), which ensures a fair selection of patients that will benefit most from proton therapy.


Asunto(s)
Neoplasias , Terapia de Protones , Humanos , Países Bajos , Reconocimiento de Normas Patrones Automatizadas , Neoplasias/radioterapia , Recolección de Datos
9.
Cancers (Basel) ; 14(24)2022 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-36551602

RESUMEN

This study aims to analyze the relationship between the available variables and treatment compliance in elderly cancer patients treated with radiotherapy and to establish a decision tree model to guide caregivers in their decision-making process. For this purpose, 456 patients over 74 years of age who received radiotherapy between 2005 and 2017 were included in this retrospective analysis. The outcome of interest was radiotherapy compliance, determined by whether patients completed their scheduled radiotherapy treatment (compliance means they completed their treatment and noncompliance means they did not). A bootstrap (B = 400) technique was implemented to select the best tuning parameters to establish the decision tree. The developed decision tree uses patient status, the Charlson comorbidity index, the Eastern Cooperative Oncology Group Performance scale, age, sex, cancer type, health insurance status, radiotherapy aim, and fractionation type (conventional fractionation versus hypofractionation) to distinguish between compliant and noncompliant patients. The decision tree's mean area under the curve and 95% confidence interval was 0.71 (0.66-0.77). Although external validation is needed to determine the decision tree's clinical usefulness, its discriminating ability was moderate and it could serve as an aid for caregivers to select the optimal treatment for elderly cancer patients.

10.
J Biomed Inform ; 134: 104194, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-36064113

RESUMEN

The mining of personal data collected by multiple organizations remains challenging in the presence of technical barriers, privacy concerns, and legal and/or organizational restrictions. While a number of privacy-preserving and data mining frameworks have recently emerged, much remains to show their practical utility. In this study, we implement and utilize a secure infrastructure using data from Statistics Netherlands and the Maastricht Study to learn the association between Type 2 Diabetes Mellitus (T2DM) and healthcare expenses considering the impact of lifestyle, physical activities, and complications of T2DM. Through experiments using real-world distributed personal data, we present the feasibility and effectiveness of the secure infrastructure for practical use cases of linking and analyzing vertically partitioned data across multiple organizations. We discovered that individuals diagnosed with T2DM had significantly higher expenses than those with prediabetes, while participants with prediabetes spent more than those without T2DM in all the included healthcare categories to different degrees. We further discuss a joint effort from technical, ethical-legal, and domain-specific experts that is highly valued for applying such a secure infrastructure to real-life use cases to protect data privacy.


Asunto(s)
Diabetes Mellitus Tipo 2 , Estado Prediabético , Diabetes Mellitus Tipo 2/terapia , Costos de la Atención en Salud , Humanos , Países Bajos , Privacidad
11.
Phys Imaging Radiat Oncol ; 24: 47-52, 2022 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-36158240

RESUMEN

Background and purpose: The model based approach involves the use of normal tissue complication models for selection of head and neck cancer patients to proton therapy. Our goal was to validate the clinical utility of the related dysphagia model using an independent patient cohort. Materials and Methods: A dataset of 277 head and neck cancer (pharynx and larynx) patients treated with (chemo)radiotherapy between 2019 and 2021 was acquired. For the evaluation of the model discrimination we used statistical metrics such as the sensitivity, specificity and the area under the receiver operating characteristic curve. After the validation we evaluated if the dysphagia model can be improved using the closed testing procedure, the Brier and the Hosmer-Lemeshow score. Results: The performance of the original normal tissue complication probability model for dysphagia grade II-IV at 6 months was good (AUC = 0.80). According to the graphical calibration assessment, the original model showed underestimated dysphagia risk predictions. The closed testing procedure indicated that the model had to be updated and selected a revised model with new predictor coefficients as an optimal model. The revised model had also satisfactory discrimination (AUC = 0.83) with improved calibration. Conclusion: The validation of the normal tissue complication probability model for grade II-IV dysphagia was successful in our independent validation cohort. However, the closed testing procedure indicated that the model should be updated with new coefficients.

12.
Phys Imaging Radiat Oncol ; 22: 1-7, 2022 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-35372704

RESUMEN

Background and Purpose: Tumor recurrence, a characteristic of malignant tumors, is the biggest concern for rectal cancer survivors. The epidemiology of the disease calls for a pressing need to improve healthcare quality and patient outcomes. Prediction models such as Bayesian networks, which can probabilistically reason under uncertainty, could assist caregivers with patient management. However, some concerns are associated with the standard approaches to developing these structures in medicine. Therefore, this study aims to compare Bayesian network structures that stem from these two techniques. Patients and Methods: A retrospective analysis was performed on 6754 locally advanced rectal cancer (LARC) patients enrolled in 14 international clinical trials. Local tumor recurrence at 2, 3, and 5-years was defined as the endpoints of interest. Five rectal cancer treating physicians from three countries elicited the expert structure. The algorithmic structure was inferred from the data with the hill-climbing algorithm. Structural performance was assessed with calibration plots and area under the curve values. Results: The area under the curve for the expert structure on the training and validation data was above 0.9 and 0.8, respectively, for all the time points. However, the algorithmic structure had superior predictive performance over the expert structure for all time points of interest. Conclusion: We have developed and internally validated a Bayesian networks structure from experts' opinions, which can predict the risk of a LARC patient developing a tumor recurrence at 2, 3, and 5 years. Our result shows that the algorithmic-based structures are more performant and less interpretable than expert-based structures.

13.
Clin Transl Radiat Oncol ; 31: 93-96, 2021 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-34667884

RESUMEN

Cancer registries collect multisource data and provide valuable information that can lead to unique research opportunities. In the Netherlands, a registry and model-based approach (MBA) are used for the selection of patients that are eligible for proton therapy. We collected baseline characteristics including demographic, clinical, tumour and treatment information. These data were transformed into a machine readable format using the FAIR (Findable, Accessible, Interoperable, Reusable) data principles and resulted in a knowledge graph with baseline characteristics of proton therapy patients. With this approach, we enable the possibility of linking external data sources and optimal flexibility to easily adapt the data structure of the existing knowledge graph to the needs of the clinic.

14.
Clin Transl Radiat Oncol ; 28: 48-53, 2021 May.
Artículo en Inglés | MEDLINE | ID: mdl-33778172

RESUMEN

•Demographic features are essential for a more personalize survival prediction of spinal bone metastasis (SBM).•Women have a relatively better survival chance than men before 75 years, while men have better survival after this age.•SBM survival is not dependent on the number of spinal metastases.

15.
Arch Phys Med Rehabil ; 102(7): 1324-1330.e3, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-33711278

RESUMEN

OBJECTIVE: To explore the association between preoperative physical performance with short- and long-term postoperative outcomes in patients undergoing lumbar spinal fusion (LSF). DESIGN: Retrospective cohort. SETTING: University hospital. PARTICIPANTS: Seventy-seven patients (N=77) undergoing elective LSF were preoperatively screened on patient demographics, patient-reported outcome measures, and physical performance measures (movement control, back muscle endurance strength and extensor strength, aerobic capacity, flexibility). INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: Associations between preoperative variables and inpatient functional recovery, hospital length of stay (LOS), and 1- to 2-year postoperative pain reduction were explored using random forest analyses assessing the relative influence of the variable on the outcome. RESULTS: Aerobic capacity was associated with fast functional recovery <4 days and prolonged functional recovery >5 days (median z scores=7.1 and 12.0). Flexibility (median z score=4.3) and back muscle endurance strength (median z score=7.8) were associated with fast functional recovery <4 days. Maximum back extensor strength was associated with prolonged functional recovery >5 days (median z score=8.6). Flexibility (median z score=5.1) and back muscle endurance strength (median z score=13.5) were associated with short LOS <5 days. Aerobic capacity (median z score=8.7) was associated with prolonged LOS >7 days. Maximum back extensor strength (median z score=3.8) was associated with 1- to 2-year postoperative pain reduction and aerobic capacity (median z score=2.8) was tentative. CONCLUSIONS: Physical performance measures were associated with both short- and long-term outcomes after LSF. Adding these measures to prediction models predicting outcomes after LSF may increase their accuracy.


Asunto(s)
Vértebras Lumbares/cirugía , Aprendizaje Automático , Rendimiento Físico Funcional , Fusión Vertebral/métodos , Anciano , Estudios de Cohortes , Evaluación de la Discapacidad , Femenino , Humanos , Tiempo de Internación , Masculino , Persona de Mediana Edad , Dolor Postoperatorio , Medición de Resultados Informados por el Paciente , Periodo Posoperatorio , Valor Predictivo de las Pruebas , Periodo Preoperatorio , Recuperación de la Función , Estudios Retrospectivos , Factores de Riesgo
16.
Radiother Oncol ; 159: 183-189, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33753156

RESUMEN

BACKGROUND AND PURPOSE: Predicting outcomes is challenging in rare cancers. Single-institutional datasets are often small and multi-institutional data sharing is complex. Distributed learning allows machine learning models to use data from multiple institutions without exchanging individual patient-level data. We demonstrate this technique in a proof-of-concept study of anal cancer patients treated with chemoradiotherapy across multiple European countries. MATERIALS AND METHODS: atomCAT is a three-centre collaboration between Leeds Cancer Centre (UK), MAASTRO Clinic (The Netherlands) and Oslo University Hospital (Norway). We trained and validated a Cox proportional hazards regression model in a distributed fashion using data from 281 patients treated with radical, conformal chemoradiotherapy for anal cancer in three institutions. Our primary endpoint was overall survival. We selected disease stage, sex, age, primary tumour size, and planned radiotherapy dose (in EQD2) a priori as predictor variables. RESULTS: The Cox regression model trained across all three centres found worse overall survival for high risk disease stage (HR = 2.02), male sex (HR = 3.06), older age (HR = 1.33 per 10 years), larger primary tumour volume (HR = 1.05 per 10 cm3) and lower radiotherapy dose (HR = 1.20 per 5 Gy). A mean concordance index of 0.72 was achieved during validation, with limited variation between centres (Leeds = 0.72, MAASTRO = 0.74, Oslo = 0.70). The global model performed well for risk stratification for two out of three centres. CONCLUSIONS: Using distributed learning, we accessed and analysed one of the largest available multi-institutional cohorts of anal cancer patients treated with modern radiotherapy techniques. This demonstrates the value of distributed learning in outcome modelling for rare cancers.


Asunto(s)
Neoplasias del Ano , Carcinoma de Células Escamosas , Anciano , Neoplasias del Ano/terapia , Quimioradioterapia , Europa (Continente) , Humanos , Masculino , Países Bajos , Noruega
17.
Phys Med ; 82: 158-162, 2021 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-33639520

RESUMEN

Given the rapid growth of artificial intelligence (AI) applications in radiotherapy and the related transformations toward the data-driven healthcare domain, this article summarizes the need and usage of the FAIR (Findable, Accessible, Interoperable, Reusable) data principles in radiotherapy. This work introduces the FAIR data concept, presents practical and relevant use cases and the future role of the different parties involved. The goal of this article is to provide guidance and potential applications of FAIR to various radiotherapy stakeholders, focusing on the central role of medical physicists.


Asunto(s)
Inteligencia Artificial
18.
Radiother Oncol ; 154: 154-160, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-32966845

RESUMEN

PURPOSE: Optimal timing of surgery after neoadjuvant chemoradiotherapy (Nad-CRT) is still controversial in locally advanced rectal cancer (LARC). The primary goal of this study was to determine the best surgical interval (SI) to achieve the highest rate of pathological complete response (pCR) and secondly to evaluate the effect on survival outcomes according to the SI. PATIENTS AND METHODS: Patients data were extracted from the international randomized trials: Accord12/0405, EORTC22921, FFCD9203, CAO/ARO/AIO-94, CAO-ARO-AIO-04, INTERACT and TROG01.04. Inclusion criteria were: age≥ 18, cT3-T4 and cN0-2, no clinical evidence of distant metastasis at diagnosis, Nad-CRT followed by surgery. Pearson's Chi-squared test with Yates' continuity correction for categorical variables, the Mann-Whitney test for continuous variables, Mann-Kendall test, Kaplan-Meier curves with log-rank test, univariate and multivariate logistic regression model was used for data analysis. RESULTS: 3085 patients met the inclusion criteria. Overall, the pCR rate was 14% at a median SI of 6 weeks (range 1-31). The cumulative pCR rate increased significantly when SI lengthened, with 95% of pCR events within 10 weeks from Nad-CRT. At univariate and multivariate logistic regression analysis, lengthening of SI (p< 0.01), radiotherapy dose (p< 0.01), and the addition of oxaliplatin to Nad-CRT (p< 0.01) had a favorable impact on pCR. Furthermore, lengthening of SI was not impact on local recurrences, distance metastases, and overall survival. CONCLUSION: This pooled analysis suggests that the best time to achieve pCR in LARC is at 10 weeks, considering that the lengthening of SI is not detrimental concerning survival outcomes.


Asunto(s)
Recurrencia Local de Neoplasia , Neoplasias del Recto , Adolescente , Quimioradioterapia , Humanos , Terapia Neoadyuvante , Recurrencia Local de Neoplasia/patología , Estadificación de Neoplasias , Ensayos Clínicos Controlados Aleatorios como Asunto , Neoplasias del Recto/patología , Resultado del Tratamiento
19.
Sci Rep ; 10(1): 20526, 2020 11 25.
Artículo en Inglés | MEDLINE | ID: mdl-33239719

RESUMEN

The difference in incidence of oral cavity cancer (OCC) between Taiwan and the Netherlands is striking. Different risk factors and treatment expertise may result in survival differences between the two countries. However due to regulatory restrictions, patient-level analyses of combined data from the Netherlands and Taiwan are infeasible. We implemented a software infrastructure for federated analyses on data from multiple organisations. We included 41,633‬ patients with single-tumour OCC between 2004 and 2016, undergoing surgery, from the Taiwan Cancer Registry and Netherlands Cancer Registry. Federated Cox Proportional Hazard was used to analyse associations between patient and tumour characteristics, country, treatment and hospital volume with survival. Five factors showed differential effects on survival of OCC patients in the Netherlands and Taiwan: age at diagnosis, stage, grade, treatment and hospital volume. The risk of death for OCC patients younger than 60 years, with advanced stage, higher grade or receiving adjuvant therapy after surgery was lower in the Netherlands than in Taiwan; but patients older than 70 years, with early stage, lower grade and receiving surgery alone in the Netherlands were at higher risk of death than those in Taiwan. The mortality risk of OCC in Taiwanese patients treated in hospitals with higher hospital volume (≥ 50 surgeries per year) was lower than in Dutch patients. We conducted analyses without exchanging patient-level information, overcoming barriers for sharing privacy sensitive information. The outcomes of patients treated in the Netherlands and Taiwan were slightly different after controlling for other prognostic factors.


Asunto(s)
Neoplasias de la Boca/epidemiología , Privacidad , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Análisis Multivariante , Países Bajos/epidemiología , Pronóstico , Modelos de Riesgos Proporcionales , Análisis de Regresión , Análisis de Supervivencia , Taiwán/epidemiología
20.
Med Phys ; 47(11): 5931-5940, 2020 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-32521049

RESUMEN

PURPOSE: One of the most frequently cited radiomics investigations showed that features automatically extracted from routine clinical images could be used in prognostic modeling. These images have been made publicly accessible via The Cancer Imaging Archive (TCIA). There have been numerous requests for additional explanatory metadata on the following datasets - RIDER, Interobserver, Lung1, and Head-Neck1. To support repeatability, reproducibility, generalizability, and transparency in radiomics research, we publish the subjects' clinical data, extracted radiomics features, and digital imaging and communications in medicine (DICOM) headers of these four datasets with descriptive metadata, in order to be more compliant with findable, accessible, interoperable, and reusable (FAIR) data management principles. ACQUISITION AND VALIDATION METHODS: Overall survival time intervals were updated using a national citizens registry after internal ethics board approval. Spatial offsets of the primary gross tumor volume (GTV) regions of interest (ROIs) associated with the Lung1 CT series were improved on the TCIA. GTV radiomics features were extracted using the open-source Ontology-Guided Radiomics Analysis Workflow (O-RAW). We reshaped the output of O-RAW to map features and extraction settings to the latest version of Radiomics Ontology, so as to be consistent with the Image Biomarker Standardization Initiative (IBSI). Digital imaging and communications in medicine metadata was extracted using a research version of Semantic DICOM (SOHARD, GmbH, Fuerth; Germany). Subjects' clinical data were described with metadata using the Radiation Oncology Ontology. All of the above were published in Resource Descriptor Format (RDF), that is, triples. Example SPARQL queries are shared with the reader to use on the online triples archive, which are intended to illustrate how to exploit this data submission. DATA FORMAT: The accumulated RDF data are publicly accessible through a SPARQL endpoint where the triples are archived. The endpoint is remotely queried through a graph database web application at http://sparql.cancerdata.org. SPARQL queries are intrinsically federated, such that we can efficiently cross-reference clinical, DICOM, and radiomics data within a single query, while being agnostic to the original data format and coding system. The federated queries work in the same way even if the RDF data were partitioned across multiple servers and dispersed physical locations. POTENTIAL APPLICATIONS: The public availability of these data resources is intended to support radiomics features replication, repeatability, and reproducibility studies by the academic community. The example SPARQL queries may be freely used and modified by readers depending on their research question. Data interoperability and reusability are supported by referencing existing public ontologies. The RDF data are readily findable and accessible through the aforementioned link. Scripts used to create the RDF are made available at a code repository linked to this submission: https://gitlab.com/UM-CDS/FAIR-compliant_clinical_radiomics_and_DICOM_metadata.


Asunto(s)
Metadatos , Bases de Datos Factuales , Alemania , Humanos , Reproducibilidad de los Resultados , Flujo de Trabajo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...